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^ (54) Title: ASTHMA SUSCEPTIBILITY LOCUS 

^ (57) Abstract: The present invention describes a susceptibility locus which is functionally related to asthma. The locus maps within 
human chromosome 7pl5-pl4. The invention also describes a novel human gene, GPRA. The invention provides diagnostic methods 

Q and materials for analysing allelic variation in said locus and the GPRA gene. The invention also provides polypeptides encoded 
by GPRA gene and antibodies binding to said polypeptides. The invention further provides pharmaceutical compositions for the 

1^ treatment of asthma, other IgE-mediated disease, chronic obstructive pulmonary disease or cancer. 
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ASTHMA SUSCEPTIBILITY LOCUS 

FIELD OF THE INVENTION 

The present invention resides in the field of molecular genetics. 

BACKGROUND OF THE INVENTION 
We have previously mapped a susceptibility locus for asthma and high total serum 
Immunoglobulin E level (IgE) to a large region spanning -19 cM of chromosome 7pl5-pl4, 
This locus was the only locus that reached genome wide significance level among the Finnish 
asthntia families (Laitinen, 2001). Linkage was further confirmed among the French-Canadian 
families recruited based on asthma and in another independent data set firom Finland recruited 
based on high IgE level (Laitinen, 2001). Daniels et al (1996) have reported among 
Australian and British families six regions of possible linkage including 7pl4-pl5 and 
established by simulations that at least some of them are likely to be true positives. Fine 
mapping of the region revealed bimodal linkage to bronchial hyperreactivity and blood 
eosinophil count at D7S484 (P=0.0003) and D7S669 (P=0.006) 63 cM apart (Leaves et al 
2002). German (Wjst et al 1999), French (Dizier et al 2000), and ItaUan families (Malerba 
et al 2000) have shown some evidence of linkage, but there are also genome scans with 
inconclusive results (Ober et al 2000; Xu et al 2000; Mathias et al 2001), some of which 
have been done in ethnic populations other than Caucasian (Yokouchi et al 2000; Xu et al 
2001). By comparing genome scans in different immune disorders, it has been suggested that 
clinically distinct autoimmune diseases may be controlled by a common set of siisceptibility 
genes (Becker et al 1998). 7pl5-pl4 has also been linked to diseases such as multiple 
sclerosis (Sawcer et al 1996) and inflammatory bowel disease (Satsangi et al 1996), and 
genomic regions homologous to human 7plS-pl4 have been linked to insulin dependent 
diabetes (Jacob et al 1992) and inflammatory arthritis (Remmers et al 1996) in rat models. 

SUMMARY OF THE INVENTION 
The present invention provides an isolated, purified asthma locus- 1 (ASTl) nucleic acid and 
a complement or a fiagment thereof The invention also provides nucleic acids comprising at 
least one single nucleotide polymorphism and/or deletion/insertion polymorphism in different 
positions in asthma locus- 1. One object of the invention is to provide vectors, host cells, 
primers, and probes comprising asthma locus- 1 nucleic acid according to the invention. The 
present invention is also related to a method for the diagnosis of a single nucleotide 
polymorphism or a deletion/insertion polymorphism in asthma locus- 1 according to the 
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invention in a human, which method comprises determining the sequence of the nucleic acid 
of the human at one or more of positions in asthma locus- 1 and determining the status of the 
human by reference to polymorphism m asthma locus- 1. The invention is also related to a kit 
for use in the diagnostics of asthma and other IgE mediated allergic diseases or in assessing 
the predisposition of an individual to asthma and other IgE mediated allergic diseases. The 
invention is further related to a method for identifying a mutation, which increases 
individual's susceptibility to develop asthma and other IgE mediated allergic diseases. The 
invention also provides a transgenic animal comprising asthma locus- 1 nucleic acid 
according to the invention. 

The invention provides an isolated GPRA polypeptide comprising an amino acid sequence 
that has at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ. ID NOS: 3, 5, 7, 9, 11, 13 and 15 over the entire length of the selected 
SEQ ID No: when compared using the BLASTP algorithm with a wordlength (W) of 3, and 
the BLOSUM62 scoring matrix. 

The invention further provides an isolated GPRA polypeptide comprising at least 10 
contiguous amino acids firom amino acids 343-377 of B-long (SEQ ID N0:5). 

The invention further provides an isolated GPRA polypeptide comprising an amino acid 
sequence that has at least 80% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ. ID NOS: 3, 5, 7, 9, 11, 13 and 15 over a sequence comparison 
window of at least 40 amino acids when compared using the BLASTP algorithm with a 
wordlength (W) of 3, and the BLOSUM62 scoring matrix provided that the polypeptide 
includes a variant amino acid encoded by a variant form shown in Table 7. 

The invention further provides an isolated GPRA polypeptide comprismg the amino acid 
sequence of SEQ ID NO:S provided that the sequence contain as an amino acid substitution 
of Asn instead of He at codon position 107, Arg instead of Ser at codon position 241, and/or 
Thr instead of He at codon position 366. 

The invention further provides an an isolated nucleic acid encoding a GPRA polypeptide as 
defined above. 
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The invention further provides an isolated nucleic acid that hybridizes under highly stringent 
conditions to any of SEQ ID NOS: 1, 4, 6, 8, 10, 12, and 14 without hybridizing under the 
same highly stringent conditions to SEQ ID N0:2, wherein the highly stringent conditions are 
6 X NaCl/sodium citrate (SSC) at about 45 °C for a hybridization step, followed by a wash of 
2 X SSC at 50 

The invention further provides an isolated nucleic acid having a sequence that is at least 90 % 
identical to a nucleic acid having a sequence selected from the group consisting of SEQ ID 
NO: 4, 6, 8, 10, 12, and 14 over the entire length of the selected SEQ ID NO when compared 
using the BLASTN algorithm with a wordlength (W) of 1 1, M=5, and N= -4. 

The invention further provides an isolated nucleic acid having a sequence that is at least 80 % 
identical to a nucleic acid having a sequence selected from the group consisting of SEQ ID 
NO: 1, 2, 4, 6, 8, 10, 12, and 14 over a sequence comparison window of at least 100 
nucleotides when compared using the BLASTN algorithm with a wordlength (W) of 1 1, 
M=5, and N= -4 provided that the nucleic acid includes a polymorphic site occupied by a 
variant form as shown in Table 3 or Table 7. 

The invention further provides an isolated nucleic acid having a sequence that is at least 80 % 
identical to a nucleic acid having a sequence selected from the group consisting of SEQ ID 
NO: 1, 2, 4, 6, 8, 10, 12, and 14 over a sequence comparison window of at least 100 
nucleotides when compared using the BLASTN algorithm with a wordlength (W) of 1 1, 
M=5, and N= -4 provided that the nucleic acid includes a polymorphic site occupied by a 
reference form designated with a * in Table 7. 

The invention further provides an isolated genomic DNA molecule or a minigene having at 
least one intronic sequence and encoding a GPRA polypeptide that has at least 80 % 
sequence identity to an amino acid sequence selected from the group consisting of SEQ. ID 
NOS: 3, 5, 7, 9, 1 1, 13 and 15 over a region at least 40 amino acids in length when compared 
using the BLASTP algorithm with a wordlength (W) of 3, and the BLOSUM62 scoring 
matrix . 
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The invention further provides an antibody that specifically binds to an epitope within amino 
acids 343-377 of B-long (SEQ ID N0:5) or amino acids 332-366 of B-short (SEQ ID N0:7). 

The invention further provides a method of preventing or treating asthma, other IgE mediated 
disease or cancer. The method comprises administering to a patient suffering from or at risk 
of asthma, other IgE mediated disease or cancer an effective amoimt of a modulator of a 
GPRA polypeptide comprising an amino acid sequence selected firom the group consisting of 
SEQ.IDNOS 3, 5, 7, 9,11,13. 

The invention further provides a method of identifying a modulator of a GPRA polypeptide. 
The method comprises contacting a cell expressing a GPRA polypeptide with an agent; 
and determining whether the agent modulates expression of the GPRA polypeptide and/or 
signal transduction through the GPRA polypeptide, wherein the GPRA polypeptide is defined 
as above. 

The invention further provides an method of determining risk of asthma, other IgE mediated 
disease or cancer. The method comprises determining whether the individual has a variant 
polymorphic form in a GPRA gene, wherein presence of the variant polymorphic form 
indicates risk of asthma, other IgE mediated disease or cancer. 

The invention further provides a method for identifying a polymorphic site correlated with a 
disease selected from the group consisting of asthma, other IgE-mediated disease and cancer 
or susceptibility thereto. The method comprises identifying a polymorphic site within a 
GPRA gene; and determining whether a variant polymorphic form occupying the site is 
associated with the disease or susceptibility thereto. 

The invention further provides a primer or probe nucleic acid that hybridizes under higjily 
stringent conditions to a segment of SEQ ID NO: 1, 2 or 4 or a variant form thereof differing 
fix>m SEQ ID NO: 1, 2 or 4 at a position shown in Table 3 or Table 7, wherein the segment 
includes or is immediately adjacent to a polymorphic site shown in Table 3 or Table 7. 

The invention further provides a transgenic animal comprising a GPRA and/or AAAl nucleic 
acid. 
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The invention further provides a transgenic animal disposed to develop a characteristic of 
asthma, other IgE-mediated disease or cancer in which an endogenous a GPRA gene 
encoding a cognate form of a GPRA polypeptide defined by any of SEQ ID NOS: 3, 5, 7, 9, 
11,13 and 15 is functionally disrupted to prevent expression of a gene product. 

The invention further provides a kit for use in diagnosing or assessing predisposition to 
asthma, other IgE-mediated disease or cancer. The kit comprises a container; and m the 
container, a compound, preferably labeled, capable of detecting a polymorphic form at a 
polymorphic site in a susceptibility locus for asthma as defined by SEQ ID N0:1, 2 or 4. 

The invention further provides an isolated AAAl polypeptide comprising an amino acid 
sequence that has at least at least 80% sequence identity to an amino acid sequence selected 
from the group consisting of SEQ. ID NOS: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 
41 over the entire length of the selected SEQ ID No: when compared using the BLAST? 
algorithm with a wordlength (W) of 3, and the BLOSUM62 scoring matrix. 

The invention further provides an isolated nucleic acid encoding the AAAl polypeptide as 
defined above. 

The invention further provides an isolated nucleic acid having a sequence that is at least 80% 
identical to a nucleic acid having a sequence selected from the group consisting of SEQ ID 
NO: 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40 over the entire length of the selected 
SEQ ID NO when compared using the BLASTN algorithm with a wordlength (W) of 1 1, 
M=5, and N= -4. 

The invention further provides an isolated nucleic acid having at least 20 contiguous 
nucleotides from a sequence selected from the group consisting of SEQ ID NOS: 16, 18, 20, 
22, 24, 26, 28, 30, 32, 34, 36, 38 and 40. 

The invention further provides an isolated genomic DNA molecule or a minigene having at 
least one intronic sequence and encoding an AAAl polypeptide that has at least 80 % 
sequence identity to an amino acid sequence selected from the group consisting of SEQ. ID 
NOS: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41 over the entire length of the 
selected SEQ ID NO. when compared using the BLAST? algorithm with a wordlength (W) 
of 3, and the BLOSUM62 scoring matrix . 
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The invention further provides an antibody that specifically binds to a polypeptide selected 
from the group consisting of SEQ ID N0S:17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 
41. 

The invention further provides a method of preventing or treating asthma, other IgE-mediated 
disease or cancer. The method comprises administering to a patient suffering from or at risk 
of asthma, other IgE-mediated disease or cancer an effective amount of a modulator of an 
AAAl polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ. ID NOS: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41. 

The invention further provides a method of identifying a modulator of an AAAl polypeptide. 
The method comprises contacting an AAAl polypeptide with an agent; and determining 
whether the agent binds to the AAAl polypeptide, modulates expression of the AAAl 
polypeptide or modulates activity of the AAAl polypeptide, wherein the AAAl polypeptide 
comprises an amino acid sequenced as defined by any of SEQ ID N0S:17, 19, 21, 23, 25, 27, 
29,31,33, 35, 37, 39 and 41. 

The invention further provides a method of determining risk of asthma, other IgE mediated 
disease or cancer, comprising determining whether the individual has a variant polymorphic 
form in an AAAl gene, wherein presence of the variant polymorphic form indicates risk of 
asthma, other IgE mediated disease or cancer. 

The invention further provides a method for identifying a polymorphic site correlated with a 
disease selected from the group consisting of asthma, other IgE mediated disease or cancer or 
susceptibility thereto. The method comprises identifymg a polymorphic site within an AAAl 
gene, and determining whether a variant polymorphic form occupying the site is associated 
with the disease or susceptibility thereto. 

The invention provides a primer or probe nucleic acid of nucleotides that hybridizes under 
highly stringent conditions to a segment of SEQ ID NO: 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38 and 40 or a variant form thereof differing from SEQ ID N0S.16, 18, 20, 22, 24, 26, 
28, 30, 32, 34, 36, 38 and 40 at a single polymorphic site. 
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The invention further provides a transgenic animal disposed to develop a characteristic of 
asthma, other IgE-mediated disease or cancer in which an endogenous a AAAl gene 
encoding a cognate form of an AAAl polypeptide defined by any of SEQ ID NOS: 17, 19, 
21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41 is functionally disrupted to prevent expression of 
a gene product. 



BRffiF DESCRIPTION OF THE DRAWINGS 
Fig. 1: Physical map across the linkage region showing the organization of exons of known 
genes and microsatellite markers genotyped (above), and the organization of genomic BAG 
clones and contigs (below). 

Fig, 2: Physical map across the critical (SNP509783-SNP638799, a total of 129,017 bp) and 
the flanking regions showing the organization of all markers genotyped in phase three 
(above), and the organization of genomic BAG clones (below). 

Fig. 3: Genomic localization of the susceptibility haplotype (ASTl, gray region) of 129 kb 
for asthma related traits in relation to verified exons of GPRA (G protein coupled receptor for 
asthma susceptibility). The cDNA probe used in the Northern blotting is shown in shaded 
gray. Locations of PGR primers used in the identification of the gene structures and nested 
PGR primers (in bold) used in cloning of the full length cDNA of splice variants A and B are 
shown. SNPs in the exons of GPRA and their potential amino acid changes are shown below. 
Both ends of the susceptibility haplotype are shown with arrows as physical positions in the 
genomic contig NT_000380. 

Figs. 4A, 4B1, 4B2, 4C, 4D, 4E, and 4F: Deduced cDNAs (SEQ ID NOS: 2, 4, 6, 8, 10, 12, 
and 14) and amino acid sequences (SEQ ID NOS: 3, 5, 7, 9, 1 1, 13, and 15) of the alternative 
splice variants (A-F) of GPRA. Exon borders are indicated by vertical lines. The initiation 
and stop codons are shown in bold. 

Figs. 5A, 5B, 5C, SD, SE and 5F: Genomic structures of the alternative splice variants (A-F) 
of GPRA, The exons (gray boxes) are depicted by numbered boxes (E1-E9), the length of the 
exons are indicated below, and the length of the separating introns above. The translation- 
initiation site and the stop codons are indicated by arrows. The white boxes indicate the 5* 
and 3 ' UTR regions and the shaded box in the B variant exon 3 shows the alternative slicing 
of the long and short form. 

Figs. 6A, 6B, 6C, 6D, 6E and 6F: Predicted 7TM structures of the alternative splice variants 
of GPRA (SEQ ID NOS: 3, 5, 7, 9, 1 1, 13, and 15). For the splice variants A and Bio^g, 
cytoplasmic loops (=cytoloop) and extracellular loops (=exoloop) are indicated above the 
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amino acid chain, transmembrane regions are underlined and the conserved amino acids, 
characteristic for the GPR family A, are bolded. For the other splice variants (Bshort-F) the 
nimiber of predicted transmembrane regions varied from one to six. 
Fig, 7: Predicted topology of the seven transmembrane structure of the protein encoded by 
the GFRA A splice variant by TMpred. 

Fig, 8: Expression of GPRA variants in the human lung epithelial carcinoma cell line NCI- 
H358. RT-PCR was performed by using variant B, and C sfpecific primers. PCR-products 
were separated on ethidium bromide stained 1.5 % agarose gel. 
Fig. 9: Expression of GPRA by Northern blot analysis widi a 470 bp cDNA probe 
(comprising the exons E2a and E2b) in human placental tissue. Four transcripts were detected 
that were approximately 6.0 kb, 4.5 kb, 1.9 kb, and 1.0 kb in size. The expression of p-actin 
is shown as control. 

Figs. lOA, lOB, IOC, lOD, lOE, and lOF: Immunohistochemical analysis of GPRA A in 
human bronchial (lOA, lOB), lung (IOC), and colon (lOD, lOE), and skin (lOD) tissues. 
Paraffin sections were stained using immunoperoxidase technique with GPRA variant A 
specific antibody (lOA, IOC - lOD) or pre-immune sera (lOB). Strong immunostaining of 
GPRA A in bronchus (lOA) and colon (lOE) was recorded in smooth muscle cell layer in 
bronchial walls (thick arrows) and in arterial walls (thin arrows) and subepithelially 
(arrowheads) in colon, respectively. In alveolar walls and alveolar macrophages (asterisk) 
intense staining was detected (IOC). In colon (lOD) and skin (lOF) tissues, basal cell layer of 
the epithelium (white arrows) stained positive for GPRA A. Staimng of the corresponding 
sections with the pre-immune serum did not show specific reactivity (lOB). Original 
magnification x400. 

Figs. IIA, IIB, lie, IID, HE, and IIF: Immunohistochemical analysis of GPRA B in 
human bronchial (1 1 A, 1 IB), lung, small intestine (1 ID), colon (1 IE), and skin (1 IF) tissues. 
Paraffm sections were stained with flie immunoperoxidase technique with GPRA variant B 
specific antibody (1 1 A, 1 IC-D) or preimmune serum (1 IB). Strong immunostaining of 
GPRA B was recorded in the epithelium of the bronchial wall (11 A), small intestine (UD), 
colon (1 IE), and skin (1 IF) tissues (arrows). In lung intense staining of alveolar wall and 
alveolar macrophages (asterisk) was detected (1 IC). Staining of the corresponding sections 
with the preinraiune serum did not show specific reactivity (1 IB), Original magnification x 
400. 
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Figs. 12A and 12B: In Western blot analysis of several human tissues, the A variant specific 
antibody recognized four major polypeptide bands at approximately 40, 42, 44, and 50 kDa 
(12A). 50 kDa band was detectable in skeletal muscle whereas uterine muscle, colon 
epithelium and prostate showed similar expression patterns of GPRA A at approximately 50, 
44, 40 kDa. The most intensive GPRA A expression was recorded in colon muscle at 42 kDa. 
The B variant specific antibody recognized two major polypeptide bands at approximately 25 
and 39 kDa with the most intensive GPRA expression in kidney (12B). 
Fig. 13: Genomic structure of AAA 1 showing both the axon (size below) and intron (size 
above) structures of the gene. AAAl (asflmia associated altematively spliced gene 1) shows 
complex splicing, two alternative starting methionines and six alternative stop codons for the 
predicted polypeptide was identified. Location of the primers used in cloning of different 
splice variants are shown below. Gray area shows the location of ASTl. 
Fig. 14: 1-Xn different splice variants of AAAl, I-XI are fiill length cDNAs. 
Fig. 15: Sequence alignments of the predicted polypeptides encoded by the I-XI splice 
variants of AAAl (CLUSTAL W program). The conserved region of the AAAl encoded 
polypeptides is in bold. 

Fig. 16: Human multiple tissue expression array (BD, Clontech) hybridized with the probe 
specific for AAAl (a mixture of multiple splice variants) (upper panel) and location of 
studied RNAs in the dot blot (lower panel). 

Fig. 17: Human fetal multiple tissue northern blot (BD, Clontech) hybridized with the probe 
specific for AAAl (a mixture of multiple splice variants) identifies several altemative 
transcripts (arrows). 

Fig. 18: Tissue specific expression AAAl. RT-PCR with the variants I, IV, VI, X, and XI 
specific primer pairs in liver (Li), lung (Lu), testis (Te), and kidney (Ki) shows different 
patterns of transcripts in different tissues. 

Fig. 19: Variable altemative splicing for AAAl depending on genotype. RT-PCR spanning 
exons 6 to 10b of AAAl was performed on lymphoblastic RNA samples genotyped for 
ASTl. Only the non-carrier of ASTl processes normal amount of the exon 6-lOb transcript, 
whereas a homozygote and heterozygotes show either absent transcript or smaller splice 
variants. Beta-actin was used as control in parallel amplifications. 
Fig. 20: Nucleic (SEQ ID N0S:16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40) and 
amino acid sequences (SEQ ID N0S:17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41) of 
splice variants of AAAl. 
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Figs, 21A and 21B: 21A. In vitro translation of AAAl gene using rabbit reticulocyte lysate 
translation machinery with S^^ -labelled methionine in the reaction mixture. 21B. Specificity 
of the AAAl antibody (anti-AAAl). Glutathione S-transferase (GST) -fusion proteins for 
AAAl were expressed in E. coli as GST fusion proteins (GST- AAAl). 

DEFINITIONS 

Unless defined otherwise, all technical and scientific terms have the same meaning as is 
commonly understood by one of skill in the art to which this invention is related. The 
definitions below are presented for clarity. 

"Isolated" when referred to a molecule, refers to a molecule that has been identified and 
separated and/or recovered firom a component of its natural environment and thus is altered 
"by the hand of man" fi-om its natural state. For example, an isolated polynucleotide could be 
part of a vector or a composition of matter, or could be contained within a cell, and still be 
"isolated" because that vector, composition of matter, or particular cell is not the original 
environment of the polynucleotide. The term "isolated" does not refer to genomic or cDNA 
libraries, whole cell total or mRNA preparations, genomic DNA preparations (including 
those separated by electrophoresis and transferred onto blots), sheared whole cell genomic 
DNA preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide sequences of the present invention. 

**Nucleic acid", includes DNA molecules (e.g. cDNA or genomic DNA), RNA molecules 
{e.g., mKNA), analogs of the DNA or RNA generated using nucleotide analogs, and 
derivatives, fi:agments and homologs. The nucleic acid or nucleic acid molecule may be 
single-stranded or double-stranded, but preferably comprises double-stranded DNA. 
Preferred nucleic acids of the invention include segments of DNA, or their complements 
including any one of the polymorphic sites shown in Table 3, 7 or 12. The segments are 
usually between 5 and 100 contiguous bases, and often range fix)m 5, 10, 12, 15, 20, or 25 
nucleotides to 10, 15, 30, 25, 20, 50 or 100 nucleotides. Nucleic acids between 5-10, 5-20, 
10-20, 12-30, 15-30, 10-50, 20-50 or 20-100 bases are common. The polymorphic site can 
occur within any position of the segment. The segments can be firom any of the allelic forms 
of DNA shown in Table 3, 7 or 12. For brevity in Table 3, the symbol T is used to represent 
both thymidine in DNA and uracil in RNA. Thus, in RNA oligonucleotides, the symbol T 
should be construed to indicate a uracil residue. Unless otherwise apparent firom the context. 
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reference to a SEQ ID NO. of the invention, refers to the strand shown in the sequence listing 
for that SEQ ID NO., the perfect complementary strand thereof or a duplex of the two. Thus, 
for example, reference to primers hybridizing to SEQ ID N0:1 includes primers hybridizing 
to the strand shown in the sequence listing and the complementary strand. 

"Isolated nucleic acid" is separated from other nucleic acid molecules which are present in 
the natural source of the nucleic acid. Preferably, an isolated nucleic acid is free of sequences 
that naturally flank the nucleic acid {i.e, sequences located at the 5'- and S'-termini of the 
nucleic acid) in the genomic DNA of the organism from which the nucleic acid is derived. 
For example, in various embodiments, isolated asthma locus-1 molecxiles can contain less 
than about 5 kb, 4 kb, 3kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences which 
naturally flank the nucleic acid molecule in genomic DNA of the cell/tissue from which the 
nucleic acid is derived (e.g., brain, heart, liver, spleen, etc.). Moreover, an isolated nucleic 
acid molecule, such as a cDNA molecule, can be substantially free of other cellular material 
or culture medium when produced by recombinant techniques, or of chemical precursors or 
other chemicals when chemically synthesized. 

The term "fragment" or "segment" as applied to a longer nucleic acid, may ordinarily be at 
least about 10 contiguous nucleotides of the longer nucleic acid in length, typically, at least 
about 20 contiguous nucleotides, more typically, from about 20 to about 50 contiguous 
nucleotides, preferably at least about 50 to about 100 contiguous nucleotides, even more 
preferably at least about 100 contiguous nucleotides to about 300 contiguous nucleotides, yet 
even more preferably at least about 300 to about 400, and most preferably, the nucleic acid 
fragment will be greater than about 500 contiguous nucleotides in length. 

"Oligonucleotide" comprises a series of linked nucleotide residues, which oligonucleotide 
has a sufficient number of nucleotide bases to be used in a PGR reaction or other application. 
A short oligonucleotide sequence may be based on, or designed from, a genomic or cDNA 
sequence and is used to amplify, confirm, or reveal the presence of an identical, similar or 
complementary DNA or RNA in a particular cell or tissue. Oligonucleotides comprise 
portions of a nucleic acid. 

"Variant" refers to a polynucleotide differing from the polynucleotide of the present 
invention, but retaining essential properties thereof Generally, variants are overall closely 
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similar, and, in many regions, identical to the polynucleotide of the present invention. Allelic 
variants of a gene refer to variant forms of the same gene between different individuals of the 
same species. The first identified allelic form is arbitrarily designated as the reference form 
and other allelic forms are designated as alternative or variant alleles. The allelic form 
occurring most fi'equently in a selected population is sometimes referred to as the wildtype 
form. Diploid organisms may be homozygous or heterozygous for allelic forms. A diallelic 
polymorphism has two forms. A triallelic polymorphism has three forms. Cognate forms of 
a gene refers to variation between structurally and functionally related genes between species. 
For example, the human gene showing the greatest sequence identity and closest functional 
relationship to a mouse gene is the human cognate form of the mouse gene. Thus, for 
example, the invention includes primate, bovine, ovine, murine, and avian cognate forms of 
the human GPRA and AAAl genes. 

A single nucleotide polymorphism occurs at a polymorphic site occupied by a single 
nucleotide, which is the site of variation between allelic sequences. The site is usually 
preceded by and followed by highly conserved sequences of the allele (e.g., sequences that 
vary in less than 1/100 or 1/1000 members of the populations). A single nucleotide 
polymoxphism usually arises due to substitution of one nucleotide for another at the 
polymorphic site. A transition is the replacement of one purine by another purine or one 
pyrimidine by another pyrimidine. A transversion is the replacement of a purine by a 
pyrimidine or vice versa. Single nucleotide polymorphisms can also arise from a deletion of 
a nucleotide or an insertion of a nucleotide relative to a reference allele. 
A set of polymorphisms means at least 2, and sometimes 5, or more of the polymorphisms 
shown in Table 3, 7 or 12. 

"Stringency". Highly stringency conditions are well-known in the art, e.g. 6xNaCl/sodium 
citrate (SSC) at about 45 **C is applied for a hybridization step, followed by a wash of 2xSSC 
at 50 °C or , alternatively hybridization at 42 °C in 5xSSC, 20 mM NaP04, pH 6.8, 50% 
formamide; and washing at 42 ®C in 0.2xSSC. These conditions can be varied empirically 
based on the length and the GC nucleotide base content of the sequences to be hybridized, or 
based on formulas for determining such variation (See, for example, Sambrook et 
a/,"Molecular Cloning: A Laboratory Manual", Second Edition, pages 9.47-9.51, Cold Spring 
Harbor, New York: Cold Spring Harbor Laboratory Press (1989)). A low stringency is 
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defined herein as being in 4-6 X SSC/0. 1-0.5% w/v SDS at 37-45 degree of C for 2-3 hours. 
Depending on the source and concentration of nucleic acid involved in the hybridization, 
alternative conditions of stringency may be employed such as mediimi stringent conditions 
which are considered herein to be 1-4 X SSC/0.25-0.5% w/v SDS at 45 degree of Celsius for 
2-3 hours or highly stringent conditions considered herein to be 0.1-1 X SSC/0.1% w/v SDS 
at 60 degree of Celsius for 1-3 hours. 

Hybridization probes are capable of binding in a base-specific manner to a complementary 
strand of nucleic acid. Such probes include nucleic acids, peptide nucleic acids, as described 
in Nielsen etal.^ Science 254, 1497-1500 (1991). "Probes" are nucleic acid sequences of 
variable length, preferably between at least about 10 nucleotides (nt), 100 nt, or many (e.g., 
6000 nt) depending on the specific use. Often probes have 15-50 nucleotides. Probes are 
used to detect identical, similar, or complementary nucleic acid sequences. Longer length 
probes can be obtained from a natural or recombinant source, are highly specific, and much 
slower to hybridize than shorter-length oligomer probes. Probes may be single- or double- 
stranded and designed to have specificity in PCR, membrane-based hybridization 
technologies, or ELISA-like technologies. Probes also hybridize to nucleic acid molecules in 
biological samples, thereby enabling immediate applications in chromosome mapping, 
linkage analysis, tissue identification and/or typing, and a variety of forensic and diagnostic 
methods of the invention. 

The term primer refers to a single-stranded oligonucleotide capable of acting as a point of 
initiation of template-directed DNA synthesis under appropriate conditions (/.e., in the 
presence of four different nucleoside triphosphates and an agent for polymerization, such as, 
DNA or RNA polymerase or reverse transcriptase) in an appropriate buffer and at a suitable 
temperature. The appropriate length of a primer depends on the intended use of the primer 
but typically ranges fi'om 15 to 30, 40 or 50 nucleotides. Short primer molecules generally 
require cooler temperatures to form sufificiently stable hybrid complexes with the template. 
A primer need not reflect the exact sequence of the template but must be sufficiently 
complementary to hybridize with a template. The term primer site refers to the area of the 
target DNA to which a primer hybridizes. The term primer pair means a set of primers 
including a 5' upstream primer that hybridizes with the 5* end of the DNA sequence to be 
amplified and a 3', downstream primer that hybridizes with the complement of the 3* end of 
the sequence to be amplified. 
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"Vector" means any plasmid or virus encoding an exogenous nucleic acid. The term should 
also be construed to include non-plasmid and non-viral compounds which facilitate transfer 
of nucleic acid into virions or cells, such as, for example, polylysine compounds and the like. 
The vector may be a viral vector which is suitable as a delivery vehicle for delivery of the 
nucleic acid encoding the desired protein, or mutant thereof, to a cell, or the vector may be a 
non-viral vector which is suitable for the same purpose. Examples of viral and non-viral 
vectors for delivery of DNA to cells and tissues are well known in the art and are described, 
for example, in Ma etal (1997, Proc. Natl. Acad. Sci. U.S.A. 94:12744-12746). Examples of 
viral vectors include, but are not limited to, a recombinant vaccinia virus, a recombinant 
adenovirus, a recombinant retrovirus, a recombinant adeno-associated virus, a recombinant 
avian pox virus, and the like (Cranage et al, 1986, EMBO J. 5.3057-3063; Intemational 
Patent Application No. W094/17810, published August 18, 1994; Intemational Patent 
Application No, W094/23744, published October 27, 1994), Examples of non-viral vectors 
include, but are not limited to, liposomes, polyamine derivatives of DNA, and the like. 

"Recombinant polynucleotide" refers to a polynucleotide having sequences that are not 
naturally joined together. An amplified or assembled recombinant polynucleotide may be 
included in a suitable vector, and the vector can be used to transform a suitable host cell. 

"Mammal" for purposes of treatment refers to any animal classified as a mammal, including 
humans, domestic and farm animals, and zoo, sports, or pet animals, such as dogs, horses, 
cats, or cows. Preferably, the mammal is human. 

Linkage describes the tendency of genes, alleles, loci or genetic markers to be inherited 
together as a result of their location on the same chromosome, and can be measured by 
percent recombination between the two genes, alleles, loci or genetic markers that are 
physically-linked on the same chromosome. Loci occurring witibdn 50 centimorgan of each 
other are linked. Some linked markers occur within the same gene or gene cluster. 

Polymorphism refers to the occuixence of two or more genetically determined alternative 
sequences or alleles in a population. A polymorphic marker or site is the locus at which 
divergence occurs. Preferred markers have at least two alleles, each occurring at frequency 
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of greater than 1%, and more preferably greater tban 10% or 20% of a selected population. A 
poljrmoxphic locus may be as small as one base pair. 

m 

Linkage disequilibrium (LD) or allelic association means the preferential association of a 
particular allele or genetic marker with a specific allele, or genetic marker at a nearby 
chromosomal location more frequently than expected by chance for any particular allele 
frequency in the population. For example, if locus X has alleles a and b, which occur equally 
frequently, and linked locus Y has alleles c and d, which occur equally frequently, one would 
expect the haplotype ac to occur with a frequency of 0.25 in a population of individuals. If ac 
occurs more frequently, then alleles a and c are considered in linkage disequilibrium. 
Linkage disequilibrium may result from natural selection of certain combination of alleles or 
because an allele has been introduced into a population too recently to have reached 
equilibrium (random association) with between linked alleles. 

A marker in linkage disequilibrium with disease predisposing variants can be particularly 
usefiil in detecting susceptibility to disease (or association with other sub-clinical 
phenotypes) notwithstanding that the marker does not cause the disease. For example, a 
marker (X) that is not itself a causative element of a disease, but which is in linkage 
disequilibrium with a gene (including regulatory sequences) (Y) that is a causative element of 
a phenotype, can be used to indicate susceptibility to the disease in circumstances in which 
the gene Y may not have been identified or may not be readily detectable. Younger alleles 
(i.e., those arising from mutation relatively late in evolution) are expected to have a larger 
genomic segment in linkage disequilibrium. The age of an allele can be determined from 
whether the allele is shared between different human ethnic human groups and/or between 
humans and related species. 

Unless otherwise apparent from the context, any embodiment, element or feature of the 
invention can be used in combination with any other. 

"Analogs" are nucleic acid sequences that have a structure similar to, but not identical to, the 
native compound but differ from it in respect to certain components. Usually, an analog has 
the same or similar function to the native compoundAnalogs may be synthetic or from a 
different evolutionary origin and may have a similar or opposite metabolic activity compared 
to wild type. 
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Derivatives and analogs may be full length or other than full length. Derivatives or analogs of 
the nucleic acids of the invention include, nucleic acids that are substantially identical to an 
exemplified SEQ ID NO. and/or capable ofhybridizing to the complement of the sequence 
under highly stringent, moderately stringent, or low stringent conditions. 

The terms "identical" or percent "identity," in the context of two or more nucleic acids or 
polypeptides, refer to two or more sequences or subsequences that are the same or have a 
specified percentage of nucleotides or amino acid residues that are the same, when compared 
and aligned for maximxim correspondence, as measured using a sequence comparison 
algorithm such as those described below for example, or by visual inspection. 

The phrase "substantially identical," in the context of two nucleic acids or polypeptides, 
refers to two or more sequences or subsequences that have at least 80, preferably at least 
85%, more preferably at least 90%, 95%, 99% or higher nucleotide or amino acid residue 
identity, when compared and aligned for maximum correspondence, as measured using a 
sequence comparison algorithm such as those described below for example, or by visual 
inspection. Preferably, the substantial identity exists over a region of the sequences that is at 
least 40 residues (/.e., amino acids or nucleotides) in length, preferably over a longer region 
than 50 residues, more preferably at least about 90-100 residues, and most preferably the 
sequences are substantially identical over the full length of the sequences being compared, 
such as the codmg region of a nucleotide for example. For example, when a SEQ ID NO. of 
the invention serves as a reference for comparison with an object nucleic acid, the 
comparison is preferably performed over the entired length of the SEQ ID NO. of the 
invention. 

For sequence comparison, typically one sequence acts as a reference sequence, to which test 
sequences are compared. When using a sequence comparison algorithm, test and reference 
sequences are input into a computer, subsequence coordinates are designated, if necessary, 
and sequence algorithm program parameters are designated. The sequence comparison 
algorithm then calculates the percent sequence identity for the test sequence(s) relative to the 
reference sequence, based on the designated program parameters. 
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Optimal alignment of sequences for comparison can be conducted, e,g., by the local 
homology algorithm of Smith & Waterman, Adv. AppL Math. 2:482 (1981), by the homology 
alignment algorithm of Needleman & Wunsch, J. Mol Biol 48:443 (1970), by the search for 
similarity method of Pearson & Lipman, Proc. Natl Acad ScL USA 85:2444 (1988), by 
computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA 
in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., 
Madison, WI), or by visual inspection (see generally Ausubel et al, supra). 

Another example of algorithm that is suitable for determiiiing percent sequence identity and 
sequence similarity is the BLAST algorithm, which is described in Altschul et aL, J. Mol 
Biol 215:403-410 (1990). Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http://www.ncbi.nhn,nih.gov/). 
This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 
short words of length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et al, supra.). 
These initial neighborhood word hits act as seeds for initiatmg searches to find longer HSPs 
containing them. The word hits are then extended in both directions along each sequence for 
as far as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cimiulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. For identifying whether a nucleic acid or polypeptide is within the 
scope of the invention, the default parameters of the BLAST programs are suitable. The 
BLASTN program (for nucleotide sequences) uses as defeults a word length (W) of 1 1, an 
expectation (E) of 10, M=5, N=-4, and a comparison of both strands. For amino acid 
sequences, the BLASTP program uses as defaults a word length (W) of 3, an expectation (E) 
of 10, and the BLOSIJM62 scoring matrix. The TBLATN program (using protein sequence 
for nucleotide sequence) uses as defaults a word length (W) of 3, an expectation (E) of 10, 
and a BLOSUM 62 scoring matrix, (see Henikoff & Henikoff, Proa Natl Acad. Sci. USA 
89:10915 (1989)). 
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In addition to calculating percent sequence identity, the BLAST algorithm also performs a 
statistical analysis of the sknilarity between two sequences (see, e.g.^ Karlin & Altschul, 
Proc. Nan Acad ScL USA 90:5873-5787 (1993)). One measure of similarity provided by 
the BLAST algorithm is the smallest sum probability (P(N)), which provides an indication of 
the probability by which a match between two nucleotide or amino acid sequences would 
occur by chance. For example, a nucleic acid is considered similar to a reference sequence if 
the smallest sum probability in a comparison of the test nucleic acid to the reference nucleic 
acid is less than about 0.1, more preferably less than about 0.01, and most preferably less than 
about 0.001. 

Another indication that two nucleic acid sequences are substantially identical is that the two 
molecules hybridize to each other under highly stringent conditions. "Bind(s) substantially" 
refers to complementary hybridization between a probe nucleic acid and a target nucleic acid 
and embraces minor mismatches that can be accommodated by reducing the stringency of the 
hybridization media to achieve the desired detection of the target polynucleotide sequence. 
The phrase "hybridizing specifically to", refers to the bmding, duplexing, or hybridizing of a 
molecule only to a particular nucleotide sequence imder highly stringent conditions when that 
sequence is present in a complex mixture (e.g., total cellular) DNA or RNA. 

The phrases "specifically binds" refers to a binding reaction which is determinative of the 
presence of the protein in the presence of a heterogeneous population of proteins and other 
biologies. Thus, under designated conditions, a specified ligand binds preferentially to a 
particular protein and does not bind m a significant amount to other proteins present in the 
sample. A molecule such as antibody that specifically binds to a protein often has an 
association constant of at least 10^ NT* or 10^ M"^ preferably 10^ M"^ to 10^ M'\ and more 
preferably, about 10^° M"^ to 10^^ M'^ or higher. An antibody that specifically binds to one 
segment of a protein (e.g., residues 1-10) does not bind to other segments of the protein not 
included within or overlapping the designted segment. 

The term AAAl gene means nucleotides 163615-684776 of the human genomic contig 
NT_000380 and allelic or species variants thereof. 



wo 2004/056866 PCT/FI2003/000973 

19 

The term GPRA gene means nucleotides 471478-691525 of the human genomic contig 
NT_000380 and allelic or species variants thereof. 

A "pharmacological" activity means that an agent exhibits an activity in a screening system 
that indicates that the agent is or may be useful in the prophylaxis or treatment of a disease. 
The screening system can be in vitro, cellular, animal or human. Agents can be described as 
having pharmacological activity notwithstanding that further testing may be required to 
establish actual prophylactic or therapeutic utility in treatment of a disease. 

Pulmonary diseases accociated with lower airways obstruction are classified as chronic 
obstructive pulmonary disease (COPD) or asthma on the basis of the clinical picture. 
"Asthma" is a chronic inflammatory disease of the airways causing variable airflow 
obstruction that is often reversible either spontaneously or with treatment Asthma is highly 
correlated with other allergic, Ig£ mediated diseases, such as allergic rhinitis and dermatitis. 
COPD is characterized by slowly processing, mainly irreversible airway obstruction and 
decreased expiratory flow rate. 

DETAILED DESCRIPTION OF THE INVENTION 

I. General 

This invention is based in part on the discovery and characterization of a novel susceptibility 
locus for asthma and other IgE mediated diseases, two overlapping genes within the locxxs, 
termed GPRA and AAAl, and proteins encoded by the genes. The genes have opposite 
transcriptional orientations and share intronic regions although not exons. The locus maps 
within human chromosome 7pl5-pl4. 

Fig. 3 shows the structure of the human GPRA gene. The gene has nine exons, two of which 
have alternate forms, and eight nitrons. The total length of the gene from the first base of the 
first exon to the last base of the last exon is 220047 bp and the coordinates are 471478 
(beginning) 691525 (end) inNT_000380. Part of the genomic sequence of GPRA including 
part of intron 2, exon 3, intron 3, exon 4 and part of intron 4 defines a sublocus referred to as 
ASTl. This locus is 129,017 bp in length. The locus refers to the part of hxraian 
chromosome 7 having the 129,017 bp of SEQ ID N0:1 or allelic or species variants thereof. 
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Fig. 13 shows the structure of the human AAAl gene. The gene has nineteen exons, nine of 
which have alternate forms, and 18 introns. The total length of the gene from the first base of 
the first exon to the last base of the last exon is 521,161 bp and the coordinates are 163,615 
(beginning) 684776 (end) in NT_000380. Part of the genomic sequence of AAAl including 
part of intron 2, exon 3 (a and b), intron 3, exon 4 (a and b), intron 4, exon 5 (a and b), intron 
5, exon 6, intron 6, exon 7 (a and b), intron 7, exon 8, intron 8, exon 9, intron 9, exon 10 (a 
and b) and part of intron 10 occurs within the ASTl locus. 

The present application shows that the GPRA and AAAl genes and AST-1 locus within them 
are genetically linked and associated with IgE diseases, such as asthma. In addition, the 
invention provides a collection of polymorphic sites including SNPs (smgle nucleotide 
polymorphisms) and DIPs (deletion/insertion polymorphisms) within the locus. Tables 3, 7 
and 12 show variant and reference (or wildtype) forms occupying these sites. The variant 
forms are associated with asthma, other IgE mediated diseases, and/or immune-mediated 
diseases including autoimmune diseases, and/or cancer and are useful in detection, diagnosis, 
treatment and prophylaxis thereof. 

The AST-1 locus was localized by hierarchical LD mapping m a systemic search for a shared * 
haplotype among the individuals with high IgE serum level across a 19 Mb region that was 
first identified by a genome wide screen for asthma genes (Laitinen et al 2001). This 
analysis defined the AST-1 locus between the markers SNP509783 and SNP638799 and 
provided statistically significant evidence that specific haplotype of 129kb within the linkage 
peak is a risk factor for asthma related traits (SEQ ID NO: 1). The best haplotype patterns of 
polymorphic sites in GPRA (Table 8) were found with the firequency of 13-18% among the 
high IgE associated chromosomes compared to that of 3-7% among control chromosomes 
suggesting the risk ratio of 3.9 [95%CI: 1.8-8.6] for high IgE level (38/304 among affected 
vs. 7/220 control chromosomes). The best observe associations for polymorphic sites in 
AAAl reached the %! values of 8.9-13.6 All markers identified between the positions 
509,783-638,799 in the genomic contig NT_000380 were found to be in strong LD and 
therefore usefiil for diagnostic purposes. 

II. GPRA Polypeptides 



wo 2004/056866 PCT/Fm03/000973 

21 

The human GPRA gene can be esqjressed as at least seven different splice variants termed A, 
B-long, B-short, C, D, E, and F. The cDNAs encoding each form and amino acid sequence 
of each form are shown in Fig. 4 (SEQ ID NOS:2-15). The different splicing events that lead 
to the different forms are shown in Fig. 5. It can be seen that the B-long and B-short forms 
differ from the A form in that the B forms have a different C-terminus encoded by exon 9b, 
whereas the C terminus of the A form is encoded by exon 9A. The B-short form (SEQ ID 
N0S:6 and 7) differs from the B-long form in that the B-short form has a deletion of 10 
amino acids encoded by exon HI. The C-form (SEQ ID N0S:8 and 9) contains a segment 
encoded by exon 2b which is not found in other forms. The C-form lacks any segments 
encoded by exons 3-9. The D form (SEQ ID NOS: 10 and 11) lacks a segment encoded by 
exon 3, causing a frameshifl, whereby exons 4 and 5 are read in a different reading fi^me 
causing truncation within exon 5. Form E (SEQ ID NOS: 12 and 13) lacks exon 4 causing a 
frameshifl as a result of which exon 5 is read in a different reading frame causing truncation 
within exon 5. Form F (SEQ ID N0S:14 and 15) lacks exons 3 and 4. 

Fig. 6 shows the expected structural characteristics of the different splice variants. The A 
form includes 371 amino acids and has an extracellular N-terminal domain, seven 
transmembrane domains and an intracellular C-terminal domains (SEQ ED N0:3). The 
transmembrane domains are separated by alternating cytoplasmic and extracellular loops. 
The B-long form (377 amino acids, SEQ ID N0:5) has a similar structure. The predicted size 
of the extracellular N-terminal domain is about 50 amino acids anticipating suggestmg that 
the endogenous ligand that interacts with GPRA is a small molecule or peptide. The B-short, 
C, D, E and F variants of GPRA (SEQ ID NOS: 5, 7, 9, 1 1, 13, and 15, respectively) lack the 
7TM strutucture present in variants A and B-Long. 

Fig. 7 shows the arrangement of the A or B-Long form in a cell membrane. GPRA falls 
within the conserved class A of G-protein coupled receptors (GPRs) (Rana et al 2001, 
Johnson et al 2000). GPRs are a large and functionally diverse protein superfamily that form 
a seven-transmembrane (7TM) heUces bundle with alternating extracellular and intracellular 
loops (see WO 01/18206). Class A contains most well known members of the GPRs such as 
vasopressin, oxytocin, and bovine rodopsin receptors. For the latter the crystallographic 
structure is available (Palczewski et al 2000). GPRA is expressed in human tissues 
consistent with its role in asthma and IgE mediated disease. Transcripts of GPRA variants A, 
B. and C were expressed in several tissues including lung and NCI-H358 cell line that 
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represent cells of broncho-epithelial origin (Fig. 8 and 9). Based on immunohistochemistry, 
the A variant of the GPRA protein was e:q)ressed in smooth muscle cells of bronchi and 
arterial wall in human lung as subepithelial of smooth muscle cells in colon (Fig. 10) and the 
B (-long) variant in the epithelium of bronchi and colon (Fig. 11). 

A class of GPRA polypeptides can be defined with reference to the exemplary polypeptides 
defmed by SEQ ID NOS: 3, 5, 7, 9, 1 1, 13 and 15. The class includes alleUc, cognate and 
induced variants of the exemplified polypeptides. Preferred polypeptides shows substantial 
sequence identity to one or more of the exemplified polypeptides. The class also uicludes 
fragments of the exemplified polypeptides having at least 6, 10, 20, 50 100, 200, or 300 
contiguous amino acids firom one of the above SEQ ID NOS. Some firagments include one or 
more isolated domains firom one of the exemplified polypeptides, any of the seven 
transmembrane domains, any of three extracellular loops, any of three intracellular loops, an 
N-terminal domain and a C-terminal domain. Some polypeptides comprise the anuno acid of 
one of the above SEQ ID NOS. provided that up to 1, 2, 5, 10, 20, 30 or 34 amino acids can 
be inserted, deleted or substituted relative to tiie SEQ ID NO. Some GPRA polypeptides 
comprise an epitope found in one of the exemplified SEQ ID NOS. but not in ofliers. For 
example, B-Long (SEQ ID N0:5) has a C-terminal epitope of 35 amino acids not found in 
any of the other forms. Some GPRA polypeptides contain a polymorphic site shown in Table 
7 occupied by a variant form. 

Preferred GPRA polypeptides are receptors that can be activated to transduce a signal by the 
same ligand as the exempUfied GPRA polypeptide of SEQ ID NOS: 3, 5, 7, 9, 11, 13, and 15. 
Binding of a ligand or analog to a G-protein receptor induces an alteration in receptor G- 
protein interaction. The receptor G-protein interaction releases GDP specifically bound to 
the G protein and permits the binding of GTP, which activates the G protein. Activated G- 
protein dissociates firom the receptor and activates an effector protein, which in turn 
regulates intracellular levels of second messengers, such as adenyl cyclase, guanyl cyclase, 
and phospholipase C. Signal transduction can be monitored in cells transfected with a 
reporter construct consisting of several copies of the consensus cAMP response element 
(Arias et al. Nature 370, 226-229 (1994)), a minimal tk promoter, and a secreted alkaline 
phosphatase gene (Clontech) or can be monitored in changes of the concentrations of 
intracellular calcium. 
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in. AAAl Polypeptides 

The human AAAl gene can be expressed as at least thirteen different splice variants termed 
lA, IB, n, m, IVA, IVB, V, VI, Vn, Vin, IX, X, and XI. The cDNAs encoding each form 
and amino acid sequence of each form are shown in Fig. 20 (SEQ ID NOS: 16-41). The 
different splicing events that lead to the different forms are shown in Fig. 14. It can be seen 
that all splice variants have exon VI and the same core peptide sequence 
(AYVRRNAGRQFSHCNLHAHQFLVRRKQ). The AAAl gene is expressed predominantly 
in the testis, brain, placenta, lung, heart, skeletal muscle, kidney, liver, fetal liver and fetal 
lung. In addition to the variants described in Fig. 14, using 3 '-RACE we have found also 
several other variants expressed in different tissues. They all share the exon six. 

A class of AAAl polypeptides can be defined with reference to the exemplary polypeptides 
defined by SEQ ID N0S:17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and41. The class 
includes allelic, cognate and induced variants of the exemplified polypeptides. Preferred 
polypeptides shows substantial sequence identity to one or more of the exemplified 
polypeptides. The class also includes firagments of the exemplified polypeptides having at 
least 6, 10, 20, or 50 contiguous amino acids firom one of the above SEQ ID NOS. Some 
firagments include or consist of the above core peptide sequence. Some polypeptides 
comprise the amino acid of one of the above SEQ ID NOS. provided that up to 1, 2, 5, 10, or 
20 amino acids can be inserted, deleted or substituted relative to the SEQ ID NO. Some 
AAAl polypeptides comprise an epitope found in one of the exemplified SEQ ID NOS. but 
not in others. For example, only the variant IB contains the peptide sequence encoded by 
exon m. 

IV. GPRA Nucleic Acids 

The present invention provides isolated nucleic acids encoding GPRA polypeptides. The 
nucleic acids can be, for example, genomic, cDNA, RNA or mini-gene (z.e., a hybrid of 
genomic and cDNA). Genomic and mini-gene nucleic acids contain at least one intronic 
segment from a GPRA gene, meaning the section of human chromosome 7pl5-pl4 having 
the structure shown in Fig. 3, or allelic and cognate variants thereof. Nucleic acids of the 
invention can include coding regions, intronic regions, 3* untranslated regions, 5* untranslated 
regions, 3* flanking regions, 5* flanking regions, enhancers, promoters and other regulatory 
sequences. A class of GPRA nucleic acids can be defined with reference to exemplified 
GPRA polypeptide and nucleic acid sequences. The class includes allelic, cognate and 
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induced variants of the exemplified sequences, as well as nucleotide substitutions, which due 
to the degeneracy of the genetic code, do not effect the amino acid sequence of an encoded 
polypeptide. The class includes nucleic acids that encode the GPRA polypeptides described 
above. The class also includes GPRA nucleic acids showing substantial sequence identity to 
exemplified GPRA nucleic acids (/.e., SEQ ID N0S:1, 2, 4, 6, 8, 10, 12 and 14). SEQ ID 
NO: 1 is a genomic sequence representing the part of a GPRA gene designated Ast-1 and 
shown in Fig. 3. SEQ ID N0:2 is a 1 1 13 bp cDNA encoding GPRA variant A (Fig, 4A). 
SEQ ID N0:4 is a 1132 bp cDNA encoding GPRA variant B-Long (Fig. 4B). SEQ ID 
N0S:6, 8, 10, 12, and 14 encode GPRA variants B-Short, C, D, E and F. 

The class also includes GPRA nucleic acids that hybridize under highly stringent conditions 
with at least one of the exemplified nucleic acids. Some GPRA nucleic acids of the invention 
hybridize imder highly stringent conditions with at least one of the GPRA nucleic acids of the 
invention without hybridizing under the same conditions to others. For example, some 
GPRA nucleic acids hybridize to at least one of SEQ ID N0S:1, 4, 6, 8, 10, 12 and 14 
without hybridizing to SEQ ID N0:2. Some GPRA nucleic acids include a polymorphic site 
occupied by a variant form as shown in Table 3 or Table 7. Some GPRA nucleic acids are 
genomic or minigene. Such nucleic acids contain at least one intronic sequence fi-om any of 
the intron of a GPRA gene. Inclusion of an intronic sequence can be useful for increasing 
expression levels of a nucleic acid in cells or transgenic animals. Inclusion of an intronic 
sequence fi-om introns 2, 3 or 4 is particularly useful for analyzing splice variation of the 
GPRA gene. Nucleic acids also include firagments of exemplified sequences SEQ ID N0S:1, 
4, 6, 8, 10, 12 and 14. The fi^agments typically contain up to 10, 20, 25, 50, 100, 300, 400, 
500 or 1000 or 1500 nucleic acids from any of the above SEQ ID NOS. Optionally, 
fragments include a polymorphic site shown in Table 3 or 7. The polymorphic site can be 
occupied by either a reference or variant form, 

GPRA nucleic acids can be linked to other nucleic acids with which they are not naturally 
associated such as vector or a heterologous promoter sequence. Preferred nucleic acids of the 
invention include or are immediately adjacent to at least one of the polymorphic sites shown 
in Table 3 or 7. Such nucleic acids are useful as primers or probes for detection of specific 
alleles. Other nucleic acids are useful for expressing proteins encoded by the AST-1 locus in 
cells or transgenic animals. Other nucleic acids are useful for achieving gene suppression 
either by homologous recombination to generate a knockout animal or an antisense or siRNA 
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mechanism. Nucleic acids are also useful for identifying, purifying, and isolating nucleic 
acids encoding other, non-human, mammalian forms of asthma locus- 1. 



Nucleic acids of the invention can be isolated using standard molecular biology techniques 
and the provided sequence information (Ausubel et al. In Current protocols in Molecular 
Biology, John Wiley and Sons, publishers, 1989); Sambrook et al, supra). Cognate forms 
(/.e., nucleic acids encoding asthma locus- 1 molecules derived from species other than 
human) or other related sequences {e.g., paralogs) can be obtained by low, moderate or high 
stringency hybridization with all or a portion of the particular human sequence as a probe 
using methods well known in the art for nucleic acid hybridization and cloning. 

V. AAAl Nucleic Acids 

The present invention provides isolated nucleic acids encoding AAAl polypeptides. The 
nucleic acids can be, for example, genomic, cDNA, RNA or mini-gene (/.e, a hybrid of 
genomic and cDNA). Genomic and mini-gene nucleic acids contain at least one intronic 
segment from an AAAl gene, meaning the section of human chromosome 7pl5-pl4 having 
tiie structure shown in Fig. 13, or allelic and cognate variants thereof. Nucleic acids of the 
invention can include coding regions, intronic regions, 3' untranslated regions, 5' untranslated 
regions, 3* flanking regions, 5' flanking regions, enhancers, promoters and other regulatory 
sequences. A class of AAAl nucleic acids can be defined with reference to exemplified 
AAAl polypeptide and nucleic acid sequences. The class includes allelic, cognate and 
induced variants of the exemplified sequences, as well as nucleotide substitutions, which due 
to the degeneracy of the genetic code, do not effect the ammo acid sequence of an encoded 
polypeptide. The class includes nucleic acids that encode the AAAl polypeptides described 
above. The class also includes AAAl nucleic acids showing substantial sequence identity to 
exemplified AAAl nucleic acids SEQ ID N0S:1, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38 and 40). SEQ ID N0:1 is a genomic sequence representing the part of an AAAl gene 
designated Ast-1 and shown in Fig. 13. SEQ ID N0S:16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38 and 40 encode cDNA splice variants shown in Fig. 20. 

The class also includes AAAl nucleic acids that hybridize imder highly stringent conditions 
with at least one of the exemplified nucleic acids. Some AAAl nucleic acids of the invention 
hybridize under highly stringent conditions with at least one of the exemplified nucleic acids 
(i.e., SEQ ID N0S:16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40) without hybridizing 



wo 2004/0S6866 PCT/FI2003/000973 

26 

under the same conditions to other exemplified nucleic acids. Some AAAl nucleic acids 
include a polymorphic site occupied by a variant form as shown in Table 12. Some AAAl 
nucleic acids are genomic or minigene. Such nucleic acids contain at least one intronic 
sequence from any of the introns of an AAAl gene. Nucleic acids also include fragments of 
exemplified sequences SEQ ID N0S:16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40. 
The fragments typically contain up to 10, 20, 25, 50, or 100, nucleotides from any of the 
above SEQ ID NOS. Optionally, fragments mclude a polymorphic site shown in Table 12. 
The polymorphic site can be occupied by either a reference or variant form. 

AAAl nucleic acids can be linked to other nucleic acids with which they are not naturally 
associated such as vector or a heterologous promoter sequence. Preferred nucleic acids of the 
invention include or are immediately adjacent to at least one of the polymorphic sites shown 
in Table 3 or Table 12. Such nucleic acids are useful as primers or probes for detection of 
specific alleles. Other nucleic acids are useful for expressing AAAl polypeptides in cells or 
transgenic animals. Other nucleic acids are useful for achieving gene suppression either by 
homologous recombination to generate a knockout animal or an antisense or siRNA 
mechanism. Nucleic acids are also useful for identifying, purifying, and isolating nucleic 
acids encoding other, non-human, mammalian forms of the AAAl gene. 

As well as or instead of encoding proteins, AAAl transcripts can have roles in regulating 
expression of GPRA nucleic acids. A class of noncoding (ncRNA) has been described with 
regulatory activity in gene transcription and splicing (see e.g. reviews by Michel 2002; 
Numata et al 2003). 

VI. Antibodies 

The GPRA and AAAl polypeptides of the invention are useful for generating antibodies. 
The antibodies can be polyclonal antibodies, distinct monoclonal antibodies or pooled 
monoclonal antibodies with different epitopic specificities. Monoclonal antibodies are made 
from antigen-containing fragments of the protein by standard procedures according to the 
type of antibody (see, e.g., Kohler, et al. Nature, 256:495, (1975); and Harlow & Lane, 
Antibodies, A Laboratory Manual (C.S.H.P., NY, 1988) Queen et al, Proc. Natl Acad. Set 
USA 86:10029-10033 (1989) and WO 90/07861; Dower et al, WO 91/17271 and McCafferty 
et al, WO 92/01047 (each of which is incorporated by reference for all purposes). Phage 
display technology can also be used to mutagenize CDR regions of antibodies previously 
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shown to have affinity for the peptides of the present invention. Some antibodies bind to an 
epitope present in one form of GPRA or AAAl but not others. For example, some antibodies 
bind to an epitope within amino acids 343-377 of GPRA long B (SEQ ID N0:4). Some 
antibodies specifically bind to a GPRA polypeptide containing a variant form at a 
polymorphic site shown in Table 7 without binding to polypeptides containing a reference 
form at the site. The antibodies can be purified, for example, by binding to and elution from a 
support to which the polypeptide or a peptide to which the antibodies were raised is bound. 

Antibodies of the invention are useful, for example, in screening cDNA expression libraries 
and for identifying clones containing cDNA inserts which encode structurally-related, 
inmiunocrossreactive proteins. See, for example, Aruffo & Seed, Proc. Natl Acad. ScL USA 
84:8573-8577 (1977) (incorporated herein by reference in its entirety for all purposes). 
Antibodies are also useful to identify and/or purify immunocrossreactive proteins that are 
structurally related to SEQ ID N0S:3, 5, 7, 9, 1 1, 13 and 15 and 17, 19, 21, 23, 25, 27, 29, 
31, 33, 35, 37, 39 and 41 and fragments thereof used to generate the antibody. 

VIL Polymorphisms of the Invention 

The present invention provides a collection of polymorphic sites both single nucleotide 
polymorphisms (SNPs) and deletion/insertion polymorphisms (DIPs) in the GPRA gene. 
SNPs and DIPs can be used in mapping the human genome and, when a SNP or a DIP is 
linked with a disease or condition, to clarify genetic basis of the disease or condition, in this 
particular case, at least of asthma. 

Table 3 describes 169 SNPs and 18 DIPs occurring within exons 3 and 4 and introns 2, 3 and 
4 of a GPRA gene having variant forms associated with asthma. For AAAl the 
corresponding polymorphisms are located in exons 4, 6, 9, 10a /b and introns 2, 3, 4, 5, 6, 7, 
8, 9, and 10 (Figure 13). The locus containing these polymorphisms is also referred to as 
AST- 1 (SEQ ID NO: 1). Some of the variant forms of these polymorphism may associate 
with asthma due to an effect on the splicing of GPRA and/or AAAl gene in the formation of 
a transcript. Other polymorphic sites may be in linkage disequilibrium with such sites or sites 
in coding regions that exert effects through changing the activity of GPRA or AAAl 
polypeptides. Seven different haplotypes H1-H7 were identified that explain 94.5% of the 
genetic variation detected at the AST-1 locus and are described in Table 3. Haplotype HI was 
found identical to the reference sequence NT_000380. The first colimm in Table 3 indicates 
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the types of polymorphism (e.g., SNP or insertion/deletion). The second column indicates 
the position that the polymorphism occurs in SEQ ID NO: 1 . The third column indicates the 
position of the polymorphism occurs in NT_000380. The fourth column indicates several 
contiguous nucleotides flanking the polymorphic sites and the nature of the variant (i.e., 
asthma associated) and reference/wildtype polymorphic forms. In all polymorphic sites, the 
allele present in SEQ ID NO: 1 and NT_000380 is indicated first, and the allele fix>m anotiier 
source second. Columns 5-1 1 indicate alleles present at that site in haplotypes H1-H7. 
Column 12 indicates the location of polymorphisms in GPRA and AAAl exons. In the AST- 
1 locus, there are coding polymorphism in AAAl gene (exon 6, Asn ->Ser) and one in GPRA 
gene (exon 3, Asn -> Be). Table 13 shows polymorphisms that are unique for haplotypes Hi- 
ll?, respectively, and Table 14 polymorphisms that can be employed to recognize spesific 
haplotype combinations. 



The SNPs include the following substitutions with the variant form being indicated before the 
reference form of SEQ ED NO:- 1. Position 1 Q>referably T (i.e., variant form) instead of C 
(reference form)), position 93 (preferably G instead of A), position 918 (preferably A instead 
of G), position 983 (preferably T instead of A), position 987 (preferably C instead of T), 
position 1542 (preferably T instead of C), position 1710 (preferably G instead of A), position 
1818 (preferably T instead of C), position 1927 (preferably T instead of A), position 2254 
(preferably C instead of T), position 2937 (preferably A instead of G), position 3877 
(preferably C instead of G), position 4012 (preferably A instead of C), position 4631 
(preferably C instead of T), position 4689 Q)referably G instead of C), position 4961 
(preferably G instead of A), position 5442 (preferably G instead of C), position 5634 
(preferably A instead of G), position 5850 (preferably G instead of A), position 6312 
(preferably T instead of C), position 6392 (preferably T instead of G), position 6485 
(preferably A instead of G), position 6522 (preferably G instead of C), position 6646 
(preferably G instead of A), position 6739 (preferably A instead of G), position 6760 
(preferably C instead of T), position 7125 (preferably A instead of C), position 7229 
(preferably T instead of C), position 7277 (preferably G instead of C), position 7303 
(preferably T instead of G), position 7305 (preferably C instead of G), position 7496 
(preferably T mstead of C), position 7550 (preferably A instead of G), position 8490 
(preferably T instead of C), position 9649 (preferably G instead of T ), position 10816 
(preferably C instead of T), position 11 858 (preferably G instead of A), position 12581 
(preferably C instead of G), position 16845 (preferably C instead of T). position 16893 
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(preferably C instead of T), position 16980 (preferably C instead of T), position 17147 
(preferably A instead of G), position 17209 (preferably A instead of C), position 17435 
(preferably G instead of A), position 18383 (preferably A instead of G), position 18927 
(preferably T instead of G), position 18978 (preferably G instead of A), position 19268 
(preferably C instead of G), position 19272 (preferably T instead of A), position 19360 
(preferably A instead of G), position 19452 (preferably A instead of G), position 19671 
(preferably A instead of G ), position 19712 (preferably A instead of C), position 19774 
(preferably A instead of C), position 20038 (preferably C instead of T), position 20089 
(preferably A instead of T), position 20309 (preferably A instead of G), position 20395 
(preferably C instead of T), position 20789 (preferably T instead of G), position 21850 
(preferably T instead of C ), position 22475 (preferably C instead of T), position 22493 
(preferably G instead of A), position 22715 (preferably A instead of G), position 22869 
(preferably C instead of T), position 22934 (preferably T instead of A), position 24007 
(preferably C instead of T), position 24264 (preferably T instead of G), position 24869 
(preferably C instead of T), position 26198 (preferably T instead of C), position 26356 
(preferably T instead of C), position 26675 (preferably G instead of A), position 27404 
(preferably T instead of C), position 28197 (preferably A instead of G), position 28770 
(preferably T instead of C), position 28785 (preferably A instead of G), position 28858 
(preferably C instead of T), position 28866 (preferably C instead of G), position 31224 
(preferably A instead of G), position 31910 (preferably A instead of G), position 32124 
(preferably A instead of C), position 32185 (preferably C instead of T), position 32976 
(preferably C instead of T), position 33350 (preferably A instead of C), position 33798 
(preferably A instead of G), position 34362 (preferably C instead of G), position 34716 
(preferably C instead of A), position 35559 (preferably T instead of C), position 36551 
(preferably A instead of G), position 36909 (preferably A instead of C), position 37327 
(preferably T instead of G), position 37415 (preferably A instead of G), position 37685 
(preferably G instead of A), position 37931 (preferably T instead of C), position 37959 
(preferably T instead of C), position 39314 (preferably A instead of G), position 39343 
(preferably T instead of G), position 39927 (preferably C instead of T), position 45826 
(preferably T instead of C), position 50197 (preferably T instead of G), position 50334 
(preferably A instead of G), position 50493 (preferably A instead of G), position 50632 
(preferably A instead of G), position 50835 (preferably C instead of A), position 50955 
(preferably C instead of G), position 51217 (preferably A instead of G), position 51476 
(preferably G instead of A), position 51536 (preferably T instead of C), position 51861 
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(preferably G instead of C), position 51884 (preferably T instead of G), position 51975 
(preferably C instead of G), position 52573 (preferably A instead of G), position 52776 
(preferably C instead of G), position 53803 (preferably T instead of C), position 53922 
(preferably C instead of T), position 54148 (preferably A instead of G), position 54199 
(preferably C instead of T), position 54641 (preferably C instead of G), position 54751 
(preferably C instead of T), position 55000 (preferably A instead of G), position 55134 
(preferably A instead of G), position 56683 (preferably T instead of C), position 56856 
(preferably T instead of C), position 57790 (preferably C instead of A), position 60559 
(preferably G instead of C), position 60604 (preferably T instead of A), position 61165 
(preferably A instead of G), position 64559 (preferably T instead of C), position 65171 
(preferably G instead of A), position 65857 (preferably G instead of T), position 66164 
(preferably T instead of C), position 66190 (preferably T instead of C), position 66526 
(preferably T instead of C), position 66902 (preferably G instead of A), position 
67857(preferably A instead of T), position 67919 (preferably C instead of T), position 72270 
(preferably A instead of G), position 75115 (preferably A instead of G), position 76080 
(preferably T instead of C), position 76101 (preferably C instead of G), position 81912 
(preferably T instead of A), position 82203 (preferably G instead of A), position 82332 
(preferably T instead of C), position 82922 (preferably A instead of T), position 83552 
(preferably T instead of C), position 85227 (preferably C instead of T), position 85271 
(preferably A instead of G), position 107610 (preferably C instead of T), position 110989 
(preferably C instead of T), position 1 1 1012 (preferably T instead of C), position 1 12030 
(preferably C instead of G), position 1 12037 (preferably G instead of T), position 1 12283 
(preferably T instead of A), position 1 12726 (preferably A instead of T), position 1 12859 
(preferably C instead of A), position 1 13428 (preferably G instead of C), position 1 13645 
(preferably T instead of A), position 1 13944 (preferably G instead of T), position 1 14945 
(preferably G instead of A), position 115192 (preferably G instead of C), position 115628 
(preferably C instead of T), position 1 16032 (preferably A lastead of G), position 1 16464 
(preferably A instead of G), position 1 1 65 1 5 (preferably A instead of G), position 1 1 6926 
(preferably C instead of T), position 1 17276 (preferably A instead of T), position 123667 
(preferably A instead of G), position (preferably 123770 instead G of A), position 123788 
(preferably C instead of G), and position 129017 (preferably G instead of A). 

DIPS of the invention defined relative to SEQ ID NO: 1 include: positions 184-185 
(preferably insertion of AAGATA), positions 1202-1206 (preferably deletion of TAAGT), 
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positions 6786-6817 (preferably repeat of (TAAA)?), position 6821 (preferably deletion of 
T), positions 7240-7243 (preferably deletion of ACTT), positions 7306-7308 (preferably 
deletion of TGT), positions 7334-7335 (preferably deletion of AT), positions 9012-9035 
(preferably repeat of (CT)io), positions 9199-9201(preferably deletion of TCT), : position 
9355-9356 (preferably insertion of T), positions 9782-9785 (preferably deletion of GTCT), 
positions 22122-22123 (preferably deletion of TG), positions 26929-26968 (preferably repeat 
of (TAAA)u), positions 28495-28564 (preferably repeat of (CA)i2), position 34909 
(preferably deletion of T), positions 38850-38852 (preferably deletion of CTC), positions 
51022-51049 (preferably repeat of (CA)8), and positions 52286-52287 (preferably insertion 
ofCC). 

Table 7 provide 13 additional polymorphisms occurring within exonic regions of a GPRA 
gene. The next eight columns indicate the position of a polymorphic site in SEQ ID N0:1, 2, 
4, 6, 8, 10, 12 or 14 respectively. The next column indicates a polymorphic site and flanking 
nucleotides. The form of the polymorphic site present in one or more of SEQ ID NOS: 1, 2, 
4, 6y 8, 10, 12, and 14 is shown first and the form present in another source is indicated 
second. In all cases except those indicated by a the polymorphic form indicated first is the 
wildtype or reference form and the polymorphic form indicated second is the asthma or IgE 
associated form. In the three polymorphic sites indicated with a *, the variant form of the 
polymorphic site is found in NT_000380 (and SEQ ID N0:1, 2, 4, 6, 8, 10, 12 and/or 14), 
and the reference or wildtype allele is found in a different individual. The last column of the 
table shows whether an amino acid change occurs due to the polymorphisms. The positions 
of these polymorphisms defined with respect to SEQ ID N0:2 are: position 448 (T (variant) 
preferably to A (wildtype or reference), position 776 (C preferably to T), position 851 (C 
preferably to G), position 1 159 (G preferably to A), position 1 199 (T preferably to C), and 
position 1529 (C preferably to T); or in any one of the following positions as defmed by SEQ 
ID N0:4: position 1206 (A preferably to C), position 1225 (T preferably to C), position 1330 
(T preferably to A), and position 1338 (G preferably to A); or m any one of the following 
positions as defined by SEQ ID N0:8: position 585 (T preferably to C), position 655 (C 
preferably to A), and position 681 (A preferably to G). 

Table 8 shows the five polymorphic sites within Table 7 that showed the highest association 
with high serum IgE level. Two of the polymorphic sites involved nonconservative amino 
acid changes, one within the important region of exon 3 within AST-1. Two of the 
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polytnorpliic sites were within coding regions but did not cause amino acid changes. These 
sites may show strong association as a result of linkage disequilibrium with causative 
polymorphic sites. A fifth polymorphic site is within the 3* UTR of exon 9 A. This site may 
show a strong association for the same reason or because the nucleotide substitution has an 
effect on mRNA stability. 

Table 12 shows associations for five polymorphic sites occurring within the AAAl gene. At 
each site, the wildtype polymorphic form is indicated first, and the variant form second. The 
table shows relative frequency of associating haplotypes in individuals with high serum IgE 
level compared to those in control individuals with low serum IgE level. All haplotypes 
shown in the table gave chi square values of at least 8.9 (P <0.01) for association. 

Vin. Analvsis of Polvmorphisms 
A. Preparation of Samples 
Polymorphisms are detected in a target nucleic acid from an individual being analyzed. For 
assay of genomic DNA, virtually any biological sample (other than pure red blood cells) is 
suitable. For example, convenient tissue samples mclude whole blood, semen, saliva, tears, 
urine, fecal material, sweat, buccal, skin and hair. For assay of cDNA or mRNA, the tissue 
sample must be obtained from an organ in which the target nucleic acid is caressed. 

Many of the methods described below require amplification of DNA from target samples. 
This can be accomplished by e.g., PGR. See generally PGR Technology: Principles and 
Applications for DNA Amplification (ed. H.A. Erlich, Freeman Press, NY, NY, 1992); PGR 
Protocols: A Guide to Methods and Applications (eds. Innis, et al. Academic Press, San 
Diego, GA, 1990); Mattila etal. Nucleic Acids Res. 19, 4967 (1991); Eckert etal, PGR 
Methods and Applications 1, 17 (1991); PGR (eds. McPherson et al, IRL Press, Oxford); and 
U.S. Patent 4,683,202 (each of which is incorporated by reference for all piirposes). 

Other suitable amplification methods include the ligase chain reaction (LGR) (see Wu and 
Wallace, Genomics 4, 560 (1989), Landegren et al. Science 241, 1077 (1988), transcription 
amplification (Kwoh et al, Proc. Natl. Acad. Scl USA 86, 1 173 (1989)), and self-sustained 
sequence replication (GuateUi et al, Proc. Nat. Acad. Sci. USA, 87, 1874 (1990)) and nucleic 
acid based sequence amplification (NASBA). The latter two amplification methods involve 
isothermal reactions based on isothermal transcription, which produce both single stranded 
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RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio of 
about 30 or 100 to 1, respectively. 

B. Detection of Polytnorphisms in Target DNA 
The identity of bases occupying the polymoiphic sites shown in Table 3, 7 or 12 can be 
determined in an individual a patient being analyzed) by several methods, which are 
described in turn. 

1. Single Base Extension Methods 

Single base extension methods are described by e.g., US 5,846,710, US 6,004,744, US 
5,888,81 9 and US 5,856,092. In brief, the methods work by hybridizing a primer that is 
complementary to a target sequence such that the 3' end of the primer is immediately 
adjacent to but does not span a site of potential variation in the target sequence. That is, the 
primer comprises a subsequence from the complement of a target polynucleotide terminating 
at the base that is inunediately adjacent and 5' to the polymorphic site. The hybridization is 
performed in the presence of one or more labeled nucleotides complementary to base(s)that 
may occupy the site of potential variation. For example, for a biallelic polymorphisms two 
differentially labeled nucleotides can be used For a tetraallelic polymorphism four 
differentially labeled nucleotides can be xised. In some methods, particularly methods 
employing multiple differentially labeled nucleotides, the nucleotides are dideoxynucleotides. 
Hybridization is performed under conditions permitting primer extension if a nucleotide 
complementation a base occupying the site of variation in the target sequence is present. 
Extension incoxporates a labeled nucleotide thereby generating a labeled extended primer. If 
multiple differentially labeled nucleotides are used and the target is heterozygous then 
multiple differentially labeled extended primers can be obtained. Extended primers are 
detected providing an indication of which bas(s) occupy the site of variation in the target 
polynucleotide. The methods are particularly useful for SNPs. 

2. Allele-Specific Probes 

The design and use of allele-specific probes for analyzing polymoiphisms is described by 
e.g., Saiki et al. Nature 324, 163-166 (1986); Dattagupta, EP 235,726, Saiki, WO 89/1 1548. 
Allele-specific probes can be designed that hybridize to a segment of target DNA from one 
individual but do not hybridize to the corresponding segment (corresponding segments being 
defined by maximum alignment of the sequences being compared) from another individual 
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due to the presence of different polymorphic forms in the respective segments from the two 
individuals. Hybridization conditions should be sufficiently stringent that there is a 
significant difference in hybridization intensity between alleles, and preferably an essentially 
binary response, whereby a probe hybridizes to only one of the alleles. Some probes are 
designed to hybridize to a segment of target DNA such that the polymorphic site aligns with a 
central position {e.g,, in a 15 mer at the 7 position; in a 16 mer, at either the 8 or 9 position) 
of the probe. This design of probe achieves good discrimination in hybridization between 
different allelic forms. 

Allele-specific probes are often used in pairs, one member of a pair showing a perfect match 
to a reference form of a target sequence and the other member showing a perfect match to a 
variant form. Several pairs of probes can then be inautnobilized on the same support for 
simultaneous analysis of multiple polymorphisms within the same target sequence, 

3. Tiling Arrays 

SNPs and insertion deletion polymorphism can be detected by hybridizing target nucleic 
acids to arrays of probes tiling a reference sequence or one or more variant forms of the 
reference sequence as described by WO 95/1 1995 (incorporated by reference in its entkety 
for all purposes). 

4. Allele-Specific Amplification Methods 

An allele-specific primer hybridizes to a site on target DNA overlapping a polymorphism and 
only primes amplification of an allelic form to which the primer exhibits perfect 
complementarily. See Gibbs, Nucleic Acid Res. 17,2427-2448(1989). This primer is used 
in conjunction with a second primer which hybridizes at a distal site. Amplification proceeds 
from the two primers leadmg to a detectable product signifying the particular allelic form is 
present. A control is usually performed with a second pair of primers, one of which shows a 
single base mismatch at the polymorphic site and the odier of which exhibits perfect 
complementarily to a distal site. The single-base mismatch prevents amplification and no 
detectable product is formed. In some methods, the mismatch is included in the 3 -most 
position of the oligonucleotide aligned with the polymorphism because this position is most 
destabilizing to elongation from the primer. See, e.g., WO 93/22456. 



5. Direct-Sequencing 
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The direct analysis of the sequence of polymorphisms of the present.invention can be 
accomplished using either the dideoxy- chain termination method or the Maxam -Gilbert 
method (see Sambrook et al. Molecular Cloning, A Laboratory Manual (2nd Ed., CSHP, 
New York 1989); Zyskind et al. Recombinant DNA Laboratory Manual, (Acad. Press, 
1988)). 

6. Denaturing Gradient Gel Electrophoresis 

Amplification products generated using the polymerase chain reaction can be analyzed by the 
use of denaturing gradient gel electrophoresis. Different alleles can be identified based on 
the different sequence-dependent melting properties and electrophoretic migration of DNA in 
solution. Erlich, ed., PGR Technology, Principles and Applications for DNA Amplification, 
(W.H, Freeman and Co, New York, 1992), Chapter 7. 

7. Single-Strand Conformation Polymorphism Analysis 
Alleles of target sequences can be differentiated usmg single-strand conformation 
polymorphism analysis, which identifies base differences by alteration in electrophoretic 
migration of single stranded PGR products, as described in Orita et al, Proc. Nat Acad. Sci. 
86, 2766-2770 (1989). Amplified PGR products can be generated as described above, and 
heated or otherwise denatured, to form single stranded amplification products. Single- 
stranded nucleic acids may refold or form secondary structures which are partially dependent 
on the base sequence. The different electrophoretic mobilities of single-stranded 
amplification products can be related to base-sequence difference between alleles of target 
sequences. 

8. Other Methods 

Include oligonucleotide ligation assay or restriction fi-agment length polymorphism (RFLP) 
(see, for example, Current Protocols in Molecular Biology, eds. Ausubel et al, John Wiley & 
Sons: 1992, and Landegren et al, "Reading Bits of Genetic Information: Methods for Single- 
Nucleotide Polymorphism Analysis", Genome Research 8:769-776). 

Vn. Methods of Use 

A. Discovery of New Polymorphic Sites in AST-1 Locus 
Additional polymorphic sites beyond those hsted in Tables 3, 7 and 12 withm GPRA and 
AAAl genes and particularly, the AST-1 locus thereof can be identified by comparing a 
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GPRA or AAAl nucleic acid from different individuals with a reference sequence, such as 
SEQ. ID N0:1, 2, 4, 6, 8, 10, 12 or 14 or SEQ E) N0S:16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38 or 40. Preferably, the different individuals suffer or at risk of an immune disorder, 
such as autoimmune disease, an IgE-mediated disorder, such as asthntia, chronic obstructive 
pulmonary disease or cancer. If a new polymorphic site SNP) is found, then the link 
with a specific disease can be determined by comparing the frequency of variant polymorphic 
forms occupying this site in populations of patients with or without a disease being analyzed 
(or susceptibility thereto). A polymorphic site or region may be located in any part of the 
locus, e.g., exons, introns and promoter regions of genes. 

B. Diagnostic and Detection Methods 
The present invention further provides means for prognostic or diagnostic assays for 
determining if a subject has or is increasingly likely to develop a disease associated with the 
variation or dysfunction of the asthma locus of the invention. Such diseases include in 
addition to asthma, other IgE mediated diseases, autoimmune diseases, atopic and other 
immune diseases, chronic obstructive pulmonary disease and cancer. Basically, such assays 
comprise a detection step, wherein the presence or absence of a variant polymorphic form in 
the asthma locus-1 relative to SEQ ED N0:1, 2, 4, 6, 8, 10, 12, or 14 or SEQ ID NO: 16, 18, 
20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40 is determined in a biological sample from the 
subject. The detection step can be performed by any ofthe methods described above. 
Analogous assays can be performed in which variations in protein sequence relative to SEQ 
ID N0S:3, 5, 7, 9, 11, 13 or 15 or SEQ ID NO: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 
and 41 are determined. 

In particular, the present invention is directed to a method of determining the presence or 
absence of a variant form of a polymorphic site of the invention in a biological sample from a 
hmnan for diagnosis of asthma or for assessing the predisposition of an individual to asthma, 
other IgE-mediated disease, chronic obstructive pulmonary disease or cancer. The method 
comprises determining the sequence of the nucleic acid of a human at one or more the above 
SNP or DIP positions shown in Tables 3, 7 and 12. Optionally, the sample is contacted with 
oligonucleotide primers so that the nucleic acid region containing the potential single 
nucleotide polymorphism is amplified by polymerase chain reaction prior to determining the 
sequence. 
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The polymorphic sites can be analyzed individually or in sets for diagnostic and prognostic 
purposes. The conclusion drawn from the analysis depends on the nature and number of 
polymorphic sites analyzed. Some polymorphic sites have variant polymorphic forms that 
are causative of disease. Detection of such a polymorphic form provides at least a strong 
indication of presence or susceptibility to disease. Other polymorphic sites have variant 
polymorphic forms that are not causative of disease but are in equilibrium dislinkage with a 
polymorphic form that is causative. Detection of noncausative polymorphic forms provides 
an indication of risk of presence or susceptibility to disease. Detection of multiple variant 
forms at several polymorphic sites in a GPRA and or AAAl gene and particularly the parts of 
these genes within the AST-1 locus thereof provides an indication of increased risk of 
presence or susceptibility to disease. The results from analyzing the polymorphic sites of the 
invention can be combined with analysis of other loci that associate with the same disease 
(e.g., asthma) as polymorphic sites of the invention. Alternatively, or additionally risk of 
disease can be confirmed by performing conventional medical diagnostic tests of patient 
symptoms. 

C. Clinical Trials 

The polymorphisms of the invention are also useful for conducting clinical trials of drug 
candidates for the diseases noted above particularly asthma. Such trials are performed on 
treated or control populations having similar or identical polymorphic profiles at a defined 
collection of polymorphic sites. Use of genetically matched populations eliminates or 
reduces variation in treatment outcome due to genetic factors, leading to a more accurate 
assessment of the efficacy of a potential drug. 

Furthermore, the polymorphisms of the invention may be used after the completion of a 
clinical trial to elucidated differences in response to a given treatment For example, the set of 
polymorphisms may be used to stratify the eiuroUed patients into disease sub-types or classes. 
It may further be possible to use the polymorphisms to identify subsets of patients with 
similar polymorphic profiles who have unusual (high or low) response to treatment or who do 
not respond at all (non-responders). In this way, information about the xmderlying genetic 
factors influencing response to treatment can be used in many aspects of the development of 
treatment (these range from the identification of new targets, through the design of new trials 
to product labeling and patient targeting). Additionally, the polymorphisms may be used to 
identify the genetic factors involved in adverse response to treatment (adverse events). For 
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example, patients who show adverse response may have more similar polymorphic profiles 
than would be expected by chance. This allows the early identification and exclusion of such 
individuals from treatment. It also provides information that can be used to understand the 
biological causes of adverse events and to modify the treatment to avoid such outcomes. 

Vm. Kits 

The invention also features diagnostic or prognostic kits for use in detecting the presence of a 
SNP in a GPRA and/or AAAl gene or particularly an AST-1 locus thereof in a biological 
sample. The kit provides means for the diagnostics of asthma, other IgE-mediated disease, 
chronic obstructive puhnonary disease or cancer or for assessing the predisposition of an 
individual to a disease mediated by variation or dysfiuiction of a GPRA and/or AAAl gene. 
The kit can comprise a labeled compound capable of detecting the nucleic acid of a GPRA 
and/or AAAl gene in a biological sample. The kit can also comprise nucleic acid primers or 
probes capable of hybridizing specifically to at least of portion of a GPRA gene. Preferably, 
the primer is a minisequencing primer specific to any one of above polymorphisms. The kit 
can be packaged in a suitable contamer and preferably it contains instructions for using the 
kit. 

Often, the kits contain one or more pairs of allele-specific oligonucleotides hybridizing to 
different forms of a polymorphism. In some kits, the allele-specific oligonucleotides are 
provided immobilized to a substrate. For example, the same substrate can comprise allele- 
specific oligonucleotide probes for detecting at least 10, 50 or all of the polymoiphisms 
shown in Tables 3, 7, 12, 13, and 14. Optional additional components of the kit include, for 
example, restriction enzymes, reverse-transcriptase or polymerase, the substrate nucleoside 
triphosphates, means used to label (for example, an avidin-enzyme conjugate and enzyme 
substrate and chromogen if the label is biotin), and the appropriate buffers for reverse 
transcription, PGR, or hybridization reactions. 

IX. Transgenic Animals and Cells 

The present invention also makes available plasmids and vectors comprising the nucleic acid 
of the invention, which constructs can be used when studying biological activities of the 
AST-1 locus. The selecting of a suitable plasmid or vector for a certain use is within the 
abilities of a skilled artisan. As the host cell may be any prokaryotic or eukaryotic cell, a 
plasmid or vector comprising the locus can be used in the duplication of the locus (see, for 
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example, Sambrook et a/,"Molecular Cloning: A Laboratory Manual", Second Edition, Cold 
Spring Harbor, New York: Cold Spring Harbor Laboratory Press (1989)). 

Nucleic acids of the invention can be used to generate either transgenic animals or "knock 
out" animals which, in turn, are useful in the development and screening of therapeutically 
useful reagents. A transgenic animal (e.g., a bovine, pig, sheep, rabbit, rat, mouse or other 
rodent) is an animal having cells that contain a transgene, which transgene was introduced 
into the animal or an ancestor of the animal at an embryonic (e.g., one-cell) stage. Often all 
or substantially all of the somatic and germline cells of the transgenic animal have a copy of 
the transgene in their genome. A transgene is a DNA which is integrated into the genome of 
a cell from which a transgenic animal develops. Methods for generating transgenic animals, 
particularly animals such as mice, have become conventional in the art and are described, for 
example, in U.S. Pat. Nos. 4,736,866 and 4,870,009. Typically, a GPRA and/or AAAl 
nucleic acid is operably linked to a promoter and optionally an enhancer. The promoter and 
enhancer can be selected confer tissue specific expression. Transgenic animals that include a 
copy of a transgene introduced mto the germ line of the animal at an embryonic stage can be 
used to examine the effect of increased e3q)ression of GPRA and/or AAAl polypeptides. 
Such animals can be used as tester animals for reagents thought to confer protection firom, for 
example, diseases related to GPRA and/or AAAl . Such a transgenic animal can be treated 
with agent and a reduced incidence of a characteristic of disease compared to untreated 
animals (i.e., without the agent) bearing the transgene indicates a potential therapeutic 
intervention for the disease. Results from animal models can be conjQrmed in placebo- 
controlled clinical trials on patients with asthma, other IgB mediated disease, chronic 
obstructive pulmonary disease or cancer. 

Transgenic offspring are identified by demonstrating incorporation of the microinjected 
transgene into their genomes, preferably by preparing DNA from short sections of tail and 
analyzing by Southern blotting for presence of the transgene ("Tail Blots"). A preferred probe 
is a segment of a transgene fusion construct that is uniquely present in the transgene and not 
in the mouse genome. Altematively, substitution of a natural sequence of codons in the 
transgene with a different sequence that still encodes the same peptide yields a imique region 
identifiable in DNA and RNA analysis. Transgenic "foimder" mice identified in this fashion 
are bred with normal mice to yield heterozygotes, which are backcrossed to create a line of 
transgenic mice. Tail blots of each mouse from each generation are examined imtil the strain 
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is established and homozygous. Each successfully created founder mouse and its strain vary 
from other strains in the location and copy number of transgenes inserted into the mouse 
genome, and hence have widely varying levels of transgene expression or activity. 

Knockout animals or cells can also be made by functionally disrupting an endogenous 
cognate form of one or more genes within the AST-1 locus (i.e., GPRA and/or AAAl). 
Functional disruption means that the transgenic ammal is incapable of making a functional 
GPRA or AAAl polypeptide and/or transcript encoding the same. Inactivation of a protein 
can be achieved by forming a transgene in which a cloned variant gene is inactivated by 
insertion of a positive selection marker within the coding region. See Capecchi, Science 244, 
1288-1292 (1989). Inactivation of a transcript can be achieved by mutation within a 
promoter region. The transgene is then introduced into an embryonic stem cell, where it 
undergoes homologous recombination with an endogenous variant gene. Mice and other 
rodents are preferred animals. Such anunals provide useful drug screening systems. 

X. Methods of Screening for Drugs 

The GPRA and AAAl polypeptides and nucleic acids of the mvention of the invention are 
useful for screening for agents that modulate (e.g., agonize or antagonize) their function (e.g., 
as a G-protein coupled receptor). Such agents can be useful as drugs in compensating for 
genetic variations affecting GPRA and/or AAAl function. For example, an agonist of GPRA 
and/or AAAl function is useful in a patient having a genetic variation that decreases 
endogenous GPRA function, and an antagonist is useful in a patient having a genetic 
variation that increases endogenous GPRA and/or AAAl function. GPRA and AAAl 
polypeptides can also be used to screen known drugs to determine whether the drug has an 
incidental effect on GPRA and/or AAAl activity. A drug that has an incidental agonist 
activity should generally be avoided in patients having atypically high levels of IgE 
antibodies, and an antagonist should generally be avoided in immunosuppressed patients. 

Agents can initially be screened in a binding assay with a GPRA or AAAl polypeptides. A 
binding assay reduces the pool of initial agents to a subset that can be screened by a 
functional assay. Agents can be screened in cells transfected with nucleic acids encoding a 
GPRA and/or AAAl polypeptide of the invention or in transgenic animals transfected with a 
GPRA and/or AAAl nucleic acids and optionally and a reporter construct containing cAMP 
response element, a minimal promoter and a coding sequence for secreted alkaline 
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phosphatase. Methods for identifying agents that have a functional activity in transducing a 
signal through a cellular receptor are described by US 6,309,842. Agents for screening can 
be obtained by producing and screening large combinatorial libraries, as described above, or 
can be known drugs, or can be known ligands to related receptors to GPRA. 

Agents can also be screened for capacity to inhibit or increase expression of GPRA and/or 
AAAl nucleic acids. Suitable agents to screen include antisense polynucleotides, see 
Antisense KNA and DNA, (1988), D.A. Melton, Bd., Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY); Dagle et aL, Nucleic Acids Research, 19:1805 (1991); Uhhnann et al, 
Chem. Reviews, 90:543-584 (1990), zinc finger proteins (see WO 00/00388 and 
EP.95908614.1), and siRNAs WO 99/32619, Elbashir, EMBO J. 20, 6877-6888 (2001) and 
Nykanen et al. Cell 107, 309-321 (2001); WO 01/29058, 

Agents having pharmacological activity evidenced by cellular or animal testing can be 
subject to clinical trials in patients suffering from asthma, other IgE-mediated disease, 
chronic obstructive pulmonary disease or cancer m comparison with a placebo. 

XI. Methods of Treatment 

The invention provides methods of treating asthma, other IgE mediated disease, chronic 
obstructive puhnonaiy disease or cancer. These methods entail administering an effective 
amount of a GPRA and/or AAAl polypeptide or an agent, e.g., an antibody, that modulates 
activity of a GPRA and/or AAAl polypeptide. The agent can modulate that activity for 
example, by an effect on signal transduction through the GPRA polypeptide (usually a 
decrease) or by eflfecting expression of the GPRA and/or AAAl polypeptide (also usually a 
decrease). An agent can be administered for prophylactic and/or therapeutic treatments. A 
therapeutic amount is an amount sufficient to remedy a disease state or symptoms, or 
otherwise prevent, hinder, retard, or reverse the progression of disease or any other 
undesirable symptoms in any way whatsoever. In prophylactic applications, an agent is 
administered to a patient susceptible to or otherwise at risk of a particular disease or 
infection. Hence, a "prophylactically effective" amount is an amount sufficient to prevent, 
hinder or retard a disease state or its symptoms. In either instance, the precise amount of 
compound contained in the composition depends on the patient's state of health and weight 
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An appropriate dosage of the phannaceutical composition is detennined, for example, using 
animal studies (e.g., mice, rats) are commonly used to determine the Tnaximal tolerable dose 
of the bioactive agent per kilogram of weight. In general, at least one of the animal species 
tested is mammalian. The results from the animal studies can be extrapolated to determine 
doses for use in other species, such as hximans for example. 

The pharmaceutical compositions can be administered in a variety of different ways. 
Examples include administering a composition containing a pharmaceutically acceptable 
carrier via oral, intranasal, rectal, topical, intraperitoneal, intravenous, intramuscular, 
subcutaneous, subdermal, transdermal, intrathecal, and intracranial methods. The route of 
administration depends in part on the chemical composition of the active compound and any 
carriers. 

The components of pharmaceutical compositions are preferably of high pxmty and are 
substantially free of potentially harmfiil contaminants at least National Food (NF) 
grade, generally at least analytical grade, and more typically at least phannaceutical grade). 
To the extent that a given compound must be synthesized prior to use, the resulting product is 
typically substantially free of any potentially toxic agents, particularly any endotoxins, which 
may be present during the synthesis or purification process. Compositions for parental 
administration are also sterile, substantially isotonic and made imder GMP conditions. 
Compositions for oral administration need not be sterile or substantially isotonic but are 
usually made under GMP conditions. 

MATERIALS AND METHODS 
Families were recruited in central eastern Finland in 1994 and 1996. The methods for 

■ 

recruitment, control for population stratification, and clinical evaluation have been described 
previously in detail (Kauppi et al 1998; Laitinen et al 1997). We used self-reported asthma 
as a sampling method. Altogether 253 families were recruited, two thirds of which were trios. 
Based on retrospective verification of the disease history and the results of diagnostic tests 
(spirometry, histamine or methacholine challenge test, expiratory peak flow measurements), 
87% of the self-reported asthma patients were accqpted as verified cases (Kauppi et al 1998). 
Criteria for asthma were based on the recommendations of the American Thoracic Society 
(Dantzker, D.R. et al 1987). In addition to the 86 multiplex pedigrees included already into 
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OUT previous genome scan, 103 nuclear families with full phase information (a total of 874 
study individuals) were included to the association analysis without further selection. 

Total serum IgE level was determined by Diagnostics CAP FEIA (Kabi Pharmacia, Sweden) 
in one batch. Allergy screening was done to all study individuals at the time of the study 
(Kauppi et al 1998; Laitmen et al 1997). Based on total serum IgE level the study 
individuals were divided into two groups: high IgE responders (IgE > lOOkU/L) and low IgE 
responders (IgE < 100 kU/L), 

EXAMPLES 

EXAMPLE I 

Construction Of The Phvsical Map 
We have ordered the markers used in the analysis and estimated the physical distance 
between the markers using several pubUcly available sources as described previously by 
Polvi et al (2002). Completed chromosome 7 sequence confirms the distance and order of our 
markers 

(http://www.ncbi.nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=N^ 
096428&dopt=GenBank). 

EXAMPLE II 

Microsatellite Discovery and Genotvping 
To create a dense map of polymorphic markers spread evenly across the linkage region, we 
screened the publicly available genomic sequences for potentially polymorphic tandem 
repeats (Polvi et al 2002). Eighty microsatellites were genotyped in the whole data set (874 
samples) giving the average marker density of 220kb (range fix)m 904bp to 790kb) across the 
linkage region. All microsatellite markers and small insertions and deletions (Table IC) 
found in the critical region were genotyped usmg fluorescently labeled primers in gel 
electrophoresis on the ABI377 sequencer as described previously (Polvi et al 2002 and 
http://www.genome.helsinki.fi/eng/research/asthma/detail.htm). 
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EXAMPLE m 

Discovery of Genetic Variation in the Critical 
Region bv Direct Sequencinp 
To identify all sequence polymorphisms in the critical region, we generated a 129 kb 
reference sequence by assembling the sequence (genomic clones AC005680, AC005492, 
AC005174, AC005862.1, and AC005853.1) from the Human Genome Project (in the contig 
NT__000380 from the position 509,783 to the position 638,799). SNP discovery by re- 
sequencing was done in four selected patients and in three control individuals who were 
homozygous across the region and for the flanking markers. Repeat regions such as SINE, 
LINE, LTR, MERl and MER2 elements covered 60% of the sequence (a total of 129,017 
bp). PGR assays were designed to be -650 bp in length with 100 bp overlap with adjacent 
assays. Repeat regions were re-sequenced when it was possible to design primers in unique 
sequence within the range mentioned above. 

PGR assays were carried out in 20^1 volumes containing 20ng of genomic DNA, Ix PGR 
Buffer n, O.lmM dNTPs, 2.5 mM MgG12, 0.l^iM primer mix, and 0.5 U of DNA polymerase 
(AmpliTaqGold, Applied Biosystems). The samples were denatured for 10 min at 94X, 
followed by 35 cycles each of 30s at 94 G, 30 s at 58 G, and 30 s at 72 *^G. Elongation was 
performed for 10 min at 72X. Purified PGR fragments (Quickstep 2 PGR purification Kit, 
Edge BioSystems, MD) were sequenced in both directions using ABI Prism3100 sequencer 
and dye-terminator chemistry. We assembled the sequence reads using the Gap4 program. 

EXAMPLE IV 

SNP Genotvping 

SNP genotyping was done using two different methods: single base pair extension (SBE) or 
using altered restriction sites. SBE was done using the chemistry of Molecular Dynamics 
according to the suggestion made by the manufacturer on a Megabase 1000 sequencer 
(Molecular Dynamics, GA) using the primers listed in the Table 1 A. Allele caUing was 
performed by using the MegaBAGE SNP Profiler software (Molecular Dynamics). 

Twenty-nine of the SNPs were genotyped using different restriction enzyme digestions. All 
primers, restriction enzymes, and lengths of the digestion fragments of corresponding allele 
used in the genotyping are given in Table IB. If the SNP did not produce a natural site for 
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altered restriction, mutations were induced in PCR-primers (shown in capital letters in primer 
sequence). To improve allele calling by growing size difference between alleles, in some 
primers plasmid sequence tail of 20-30bp was added (shown in bold in the primer sequence). 
Altered restriction sites of the PGR products stained with ethidium bromide and visualized on 
agarose gels in UV light were called manually by two independent observers. All the markers 
were in Hardy- Weinberg equilibrium and observed Mendel errors, were less than 0.1%. 

EXAMPLE V 

Haplotvpe Association Analysis 
The haplotype analysis was done using Haplotype Pattern Mining (HPM) (Toivonen et al 
2000) program. For haplotyping, large pedigrees were divided into trios using an in-house 
computer program. The program identifies the maximum number of trios that are not 
overlapping and in which one or two members were affected (not both parents). Trios that 
included members who had not been genotyped or members with unknown phenotype were 
excluded. In phase three of hierarchical LD mapping, high density genotyping of the critical 
region was done in the trios informative for the association analysis (a total of 132 trios, 396 
study individuals, and revealing 304 unrelated affected and 220 control chromosomes). 
Haplotyping was done within each trio and four independent chromosomes were obtained 
from each trio. In case of ambiguities (missing genotype data, identical heterozygotic 
genotypes in all of the family members, or Mendel errors), the alleles were discarded. If the 
child was affected, the transmitted chromosomes were considered disease associated and the 
non-transmitted chromosomes as controls. If one of the parents was affected, his/her 
chromosomes were considered disease associated and the spouse's chromosomes as controls. 
If both the parent and the child were affected, only the non-transmitted chromosome of an 
unaffected parent was considered as the control and the other three as disease associated. 
These haplotypes were used as input for HPM. 

To estimate the overall significance of detected association, accounting for simultaneous 
testing of multiple markers, we used a second level permutation test by performing nested 
permutation tests described in detail elsewhere (Sevon et al 2001a and b). 



RESULTS 
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1. Haplotvpe Association 

Our genotyping effort searching for a shared haplotype among individuals who have high 
total IgE serum level was done in three phases using a hierarchical approach. Based on the 
initial observation on potential association, the marker map was made daoser in the regions of 
primary interest. In the haplo^e analysis we used the HPM algorithm that is known to be 
powerful in locating a disease-causing gene when one is known to exist (Toivonen et al. 
2000). By allowing gaps in the haplotype patterns, the algorithm is robust for genotyping 
errors, marker mutations, unrecognized recombinations and missing data. The data set 
consisted of 86 multiplex pedigrees that were included also in the previous linkage study and 
additional 103 nuclear families recruited solely for the association study. These families 
revealed 304 unrelated high IgiE associated and 220 control chromosomes which were then 
used as the input data for the HPM analysis. 

First we built a marker map across the whole linkage region with 80 microsatellite markers 
(Fig. 1). In the first phase of haplotype association analysis the marker density was 
approximately 220kb across the linkage region. The best associations (dii square >7.4) 
formed only one cluster of haplotypes located between the markers D7S690-NM13 
(~3.21cM). In the second phase we added 6 new microsatellites in between D7S690 and 
NM13. The associated haplotypes (chi square >8.9) clustered now at the centromeric end of 
the region, between markers G42099 and NM13 (301kb) (Fig. 2). HPM algorithm was used 
to evaluate the overall haplotype distribution among the high IgE associated conq)ared to that 
among control chromosomes. A permutation test incorporated to the HPM computer package 
was used to evaluate the statistical significance of our fmding by comparing the overall 
haplotype distribution among the high IgE associated and control chromosomes. Using chi 
square 7.0 as the threshold for association, 7 markers as the maximum length of the 
haplo^e, and allowing one gap in the haplotype patterns, the best associations were 
computed for three adjacent markers G42102, D7S497, and G4296 (marker-wise P values 
O.OS-0.06 based on 1 0 000 simulations). 

Fine mapping was then further contmued by adding 49 new markers between G42099 and 
NM13 giving an average marker density of 6kb. By using the HPM. algorithm, all associated 
haplotype (chi square >13) patterns spanned between SNP509783 and SNP638799 (129,017 
bp). 
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To study if the overall haplotype distribution in the locus differs in high IgE associated 
compared to that in control chromosomes, we used again a permutation test. HPM analysis 
was done using the following parameters: maximum pattern length 40 markers, one gap 
allowed for missing data and possible errors, and chi-square threshold for the association 
13.0. The observed scores (=number of qualified haplotype patterns spanning across the 
marker) for associated haplotypes varied from 0 to 40. Permutation test showed statistically 
biased haplotype distribution for high IgE. The marker-wise P values <0.005 were observed 
for all eleven of the markers between SNP531632 and SNP563930 and the marker wise P 
values <0.01 for all twenty-eight markers between NM51 and SNP617392 based on 10 000 
simulations. To estimate the effect of simultaneous testing of multiple markers, we used a 
second level permutation test, yielding a corrected P value of 0.03 (Sevon 2001a and b). 

2. Genetic Variation in the Critical Region 

Based on haplotype analysis we had determined the critical region in between the markers 
SNP509783 and SNP638799 (Fig. 2). SNP genotyping resulted in the identification of seven 
haplotypes H1-H7 that together explain 94.5% of the genetic variation detected at the AST-1 
locus. To identify all potentially causative genetic variations on the risk haplotype, we 
sequenced four patients who carried haplotypes H2/H2, H4/H4, H5/H5, and H7/H7, 
respectively, and were homozygous across the ASTl-locus region. The patients* sequences 
polymorphisms were compared with the reference sequence (SEQ ID N0:1) 
(http;//www.ncbi>nlm.nih>gov/entrez/auerv.fcri?cmd=Retrieve&db=nucIeotide&^^^ uid 
s-7382309&dQDtFGenBank . human genomic contig NT_000380) and to three population 
based controls homozygous for the same markers, but with different haplotypes Hl/Hl, 
H3/H3, and H6/H6, respectively. The differences are highlighted via SEQ ID N0:1. Genetic 
variation including small deletions and insertions (< 6 base pairs), variation in the length of 
microsatellite repeats, and multiple SNPs are presented in Table 3 that can be used as genetic 
markers of the risk haplotypes. Tables 13 and 14 show the SNPs unique for each haplotype 
and unique for certain haplotype combinations, respectively. In diagnostic testing these 
markers identify the haplotypes and the haplotype combinations of an individual without 
phase information. 



Example VI: Identification of the GPRA Gene 
A. Materials and Methods 



wo 2004/056866 PCT/FI2003/000973 

48 

1. Exon Prediction 

Exon predictions were performed by GENSCAN http://genes.imt.edu/GENSCAN.html) and 
software using genomic clones from AC005493,1 to AC005826.1 (690 kb) (Polvi et al 2002) 
as one block. Predictions were performed for both unmasked and masked sequence. In the 
latter, the repetitive sequences were masked using the Repeat Masker Service 
(http://repeatmasker.genome.washington.edu/). 

2. Reverse Transcriptase PGR 

The exon-specific primers were designed based on exon predictions (Table 4, Fig. 3). First 
strand cDNAs were synthesized from 1 |ig of commercially available poly A^ RNAs from 
human brain (Invitrogen), lung, testis, placenta, and thymus (BD Clontech) by the SMART 
RACE cDNA Amplification Kit (BD Clontech, Palo Alto, CA) according to manufacturer's 
instructions. Alternatively, Marathon-Ready cDNA (Clontech) or cDNA from Multiple 
Tissue cDNA Panels I and H (Clontech) were used as templates in some PGR amplifications. 
All the PGR amplifications were performed in 20-50 pi volumes using 2.0-5.0 \sX cDNA as 
template, 2 mM MgCh, 0.2 mM dNTPs (Fmnzymes), 0.2 mM of each primer, 0.15-0.4 U 
AmpliTaqGold, and Ix PGR Gold buffer (Applied Biosciences) under the following 
conditions: 94^C for 10 min, 35-40 cycles of 94 °C for 30 s, 58 °G -68 °C for 30 s, 72 ^C for 
1 min 40 s to 2 min followed by 72 ^G for 10 min. PGR products were analyzed on 1 % 
agarose gel and extracted from the gel using QIAquick Gel Extraction Kit (Qiagen). 

3. Rapid Amplification of cDNA Ends fRACE-PCR^ 

To generate 3* and 5' cDNA ends, rapid amplification of cDNA ends (SMART RAGE) was 
performed using human testis cDNA and Human Marathon-Ready Fetal Thymus cDNA 
(Clontech) according to the manufacturer's protocol for the SMART RAGE cDNA 
Amplification Kit (BD Clontech). RACE-PCR products were cloned using pGEM-T Easy 
Vector system (Promega) or TOPO TA Cloning kit (Invitrogen) and plasmid DNA was 
purified using QIAprep Spin Miniprep Kit (Qiagen). 

The purified RT-PGR products and the cloned RACE-PCR products were verified by 
automated sequencing with dye-termmator chemistry (ABI Prism3100, Applied Biosystems, 
Inc). 
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4. Cloning oftheFull>-LengthcDNAs of GPRA 

Nested PGR amplification was used in cloning of the full length cDNAs for GPRA splice 
variants A and B, The first round of PGR for variant A was done with the primer pair 
JEGEIFI and JEGE9aRl, and for variant B with JEGEIFI and JEGE9bRl (Table 4, Fig. 3). 
The following inner primers were used in the re-amplification: JEGE1F2 and JEGE9aR2 for 
variant A; JEGE1F2 and JEGE9bR2 for variant B, respectively. Full length C variant was 
amplified with the primer pair JEGE1F2 and JEGExRl. 

Primary PGR amplifications of variants A and B were performed in 25 nl volumes using 2.5 
|il Human Brain Marathon-Ready cDNA (Glontech) as ten^late, Ix DyNA2yme EXT buffer 
(contaming 1.5 mM MgGla), 0.2 mM dNTPs (Finnzymes), 0.52 \xM of each primer, 5% 
DMSO and 0.5 U DyNAzyme EXT polymerase (Finnzymes) under the following conditions: 
94^G for 4 min, 38 cycles of 94°G for 30 s, 65^G for 30 s, 72°G for 1 min followed by final 
extension of 72°C for 10 min. The aliquot of primary PGR product was re-amplified by 30 
PGR cycles under the same conditions as above. PGR amplification of variant C was 
performed in the above conditions. NGI-H358 cDNA was used as template. 

PGR products were cloned into pGR 2.1 TOPO -vector using TOPO TA cloning kit 
(Invitrogen) accordmg to manufacturer's instructions and plasmid DNA were purified using 
QIAprep Spin Miniprep Kit (Qiagen). The cloned RT-PGR products were verified by 
automated sequencing with dye-terminator chemistry (MegaBAGE 1000, Molecular 
Dynamics). 

5. Gell Culture and Isolation of Polv A"^ RNA 

In addition to commercially available poly A*** RNAs and Marathon-Ready cDNAs, human 
lung epithelial carcinoma cell line NGI-H358 (ATGG) was cultured as mRNA source for 
expression studies. Gells were cultured in RPMI 1640 medium (Gibco BRL) supplemented 
with 1 mM sodium puryvate (Gibco BRL), 10% FGS (Biological Industries), and 1% 
Penicillin/Streptomycin (GibcoBRL). Poly A"*" RNA was isolated by Dynabeads mRNA 
DIREGT Kit (Dynal) according to the manufacturer's instructions. 



6. Northern Blot Hybridization 
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The GPRA C specific (comprising the nucleotides firom 58 to 527 of the GPRA C variant) 
470 bp probe was generated by RT-PCR using human testis cDNA as template. The probe 
was radiolabelled with a["'^P]-dCTP using RediPrime Kit (Amersham Biosciences) according 
to the manufacturer's instructions. A Human Multiple Tissue 8-lane Northern blot (BD 
Clontech) was prehybridized in ExpressHyb solution (BD Clontech) for 1 h at 68 °C followed 
by hybridization with the specific probe for 1 h 15 min at 68 "C. Herring sperm DNA (100 
Hg/ml) was used as the blocking reagent. The filter was washed with 2 x SSC and 0.05 % 
SDS at room temperature and then exposed to X-ray film at -20 °C for 6 days. 



7. Western Blot and Immunohistochemistrv 
Two specific antibodies against the alternative carboxy terminals of GPRA were raised by 
immunizing rabbits with the following peptides: CREQRSQDSRMTFRERTER 
(corresponding to the nucleotides 1 148-1205 of the variant A) and 
CPQRENWKGTWPGVPSWALPR (corresponding to the nucleotides 1 196-1259 of the 
variant B of GPRA), GPRA A peptide synthesis and antibody production were purchased 
from Sigma-Genosys Ltd (London Road, Pampisford, Cambridge). GPRA B peptide 
synthesis was purchased from the University of Helsinki and antibody production from the 
University of Oulu. A total of 6 immunizations were performed at 2 weeks intervals using the 
total of 2 mg of KLH-conjugated peptide purified by gel filtration. GPRA antibodies were 
purified from whole serum by afOnity chromatography with the peptide coupled to iodoacetyl 
on a crosslinked agarose support according to the manufacturer's (Pierce, Meridian Road, 
Rockford IL, USA) instructions. 

For western blot analysis, human tissue lysates from spleen, skeletal muscle, uterine muscle, 
colon mixscle, kidney, colon epithelium, testes, and prostate were obtained by mechanically 
homogenizing the frozen tissue samples in 10 mM Tris HCl lOOmM NaCl 2 % Triton X-100 
buffer with proteinase inhibitors, 50 \xg of the protein lysates were run on reducing 12.5 % 
SDS-PAGE gels and electroblotted to the PVDF membrane according to standard procedures. 
Nonspecific protein binding was prevented by incubating the membrane with 5 % milk in 0.1 
% Tween 20 in TBS (TBST). Antigenic sites were revealed by incubating the membrane with 
the anti-GPRA C-terminal antibodies or pre-immune serum followed by the alkaline 
phosphatase-conjugated goat anti-rabbit secondary antibody (Jackson ImmunoResearch 
Laboratories Inc., West Baltimore Pike West Grove, PA, USA) in 5 % milk in 0.1 % TBST. 
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The color reaction was revealed by the NBT/BCIP method (Pierce). Negative controls did not 
show specific reactivity. 

Formalin fixed, paraffin-embedded specimens of normal adult human bronchus, skin and 
colon, and human normal tissue array slides (MaxArray, Zymed Laboratories Inc., CA) 
containing 30 different tissues were used for immunohistochemistry. For pre-treatment, slides 
were deparaffmized by xylene-treatment followed by decreasing alcohol series. The slides 
were heated in microwave oven in 10 mM citrate buffer, pH 6.0 for 5 minutes. 
Immunohistochemical analyses were performed using the ABC method (Vectastain Elite 
ABC kit, Vector Laboratories, Burlingame, California). Omission of primary antibody and 
staining with the nonimmunized sera was used as negative control for parallel sections. 
Neither of these controls showed any immunoreactivity , 

8. Transient transfections and Elisa 

Cos-1 cells were cultured in Dulbecco's modified Eagle's medium supplemented with 10% 
fetal calf serum, 1% PS and 5% non-essential amino acids. Cos-1 cells were transiently 
transfected with pCMV-myc- GPRA A, B, C, D, E, F or Bshort separately using Fugene6 
transfection reagent (Roche) following the manufacturer's protocol. After 24 hours, the 
transfected cells firom one well of 6 well plate were divided into 16 wells of 96 well plate. 
After 48 hours of transfection, the cells were fixed with 3.5 % PFA. Half of the cells were 
permeabilized with 0.5% TX-lOO for 10-15 min, blocked with TBS containmg 2% milk and 
1% goat normal serum at 37 "^C for 30 min, incubated with 1:1000 dilution of myc- specific 
primary antibody (Babco) for 1 h at 37 ""C, washed three times with TBS and thereafter 
incubated with a dilution of 1:2000 of HRP-conjugaded anti-mouse IgG antibody for 30 min 
at room temperature and washed three times witii TBS. TMB-subrate (Sigma) was added to 
cells and the reaction was let to proceed for 3 to 6 minutes after which the reaction was 
stopped by adding 1 .5 M HCL. Absorbance was measured at 450 Dm. 

9. Computational Protein Sequence Analvsis 
The transmembrane topology and putative N-glycosylation sites of GPRA protein were 
predicted using PredictProtein (littp://cubic.bioc.columbia.edu/ 
predictprotein/submit def html#top) and TMpred (http://www.ch.embnet.org/ 
software/TMPRED form.html^ softwares. In protein homology comparisons the BlastP 
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(http://www.ncbi.nlin.nih.gov/blast/) and T-Coffee 
(http://www.ch.embnet.org/software/TCoffee.html) softwares were used, 

10. Family Collection 

Genotyping of GPRA was done in a family collection recruited in central eastern Finland. 
The methods for recruitment, control for population stratification, and clinical evaluation 
have been described previously in detail (Kauppi et al 1998; Laitinen et al 1997). We used 
self-reported asthma as a sampling method. Total serum IgE level was determined by 
Diagnostics CAP FEIA (Kabi Pharmacia, Sweden) in one batch for all die participants. Based 
on total serum IgE level the study individuals were divided into two groups: high IgE 
responders (IgE > lOOkU/L) and low IgE responders (IgE < 100 kU/L). 

Altogether 253 families were recruited, two thirds of which were trios. Based on 
retrospective verification of the disease history and the results of diagnostic tests (spirometry, 
histamine or methacholine challenge test, expiratory peak flow measurements), 87% of the 
self-reported asthma patients were accepted as verified cases (Kauppi et al 1998). Criteria 
for asthma were based on the recommendations of the American Thoracic Society (Dantzker, 
D.R. et al. 1987). 86 large pedigrees were included into our previous genome scan (Laitinen 
et al 2001). For the association study, those pedigrees were divided into trios using a in- 
house computer program. Non-overlapping trios with fiiU phase and phenotype information 
and additional 103 trios were included to the association analysis without fiulher selection. 

1 1 . Screening for Exon Polymorp hisms 

Genomic DNA of four asthma patients and one healthy control fi:om above mentioned study 
group were screened for sequence variation with 12 pruner pairs covering all the verified 
exons of GPRA (Table 5). All the patients were homozygous for microsatellites G42099, 
G42100, D7S497, and G42097 (firom the NT_000380 position 476,213 to 644,425, a total of 
168 kb) and expressed the susceptibiUty haplotype (Patent 1). The control individual showed 
homozygosity for the same markers, but to a different, non-associating haplotype. 

PGR assays were carried out in 20 voliunes containing 20 ng of genomic DNA, O.lmM 
dNTPs (Finnzymes), 2.5 mM MgC12 (Applied Biosystems), 0.1 (oM primer mix, and 0.5 U of 
DNA polymerase, and Ix PGR Buffer H (AmpUTaqGold, Applied Biosystems). The samples 
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were denatured for 10 min at 94 ^C, followed by 35 cycles each of 30s at 94 °C, 30 s at 58 
°C, and 30 s at 72 °C. Elongation was performed for 10 min at 72 ''C. Purified PGR 
fragments (Quickstep 2 PGR purification Kit, Edge BioSystems) were sequenced from both 
directions using ABI Prini3100 (Applied Biosystems) sequencer and dye-terminator 
chemistry. We assembled forward and reverse sequence reads using the Gap4 program 
(Staden Package software). 

12. Genotvping 

Genotyping of five exonic SNPs in GPRA was done using two different methods. One marker 
was genotyped using single base pair extension (SBE) with the chemistry of Molecular 
Dynamics on a Megabase 1000 sequencer (Molecular Dynamics) according to the suggestion 
made by the manufacturer (Table 6A). Allele calling was performed by using the MegaBAGE 
SNP Profiler software (Molecular Dynamics). 

Four of the SNPs were genotyped using diJBferent restriction enzyme digestions. All primers, 
restriction enzymes, and lengths of the digestion firagments of corresponding allele used in 
the genotyping are given in Table 6B. If the SNP did not produce a natural site for altered 
restriction, mutations were induced in PGR-primers. To improve allele calling by growing 
size difiTerence between alleles, in one primer plasmid sequence tail was added. Altered 
restriction sites of the PGR products were visualized on ethidium bromide stained agarose 
gels in UV light and called manually by two independent observers. All the markers were in 
Hardy- Weinberg equilibrium and observed Mendel errors were less than 0.1%. 

13. Haplotvpe Association Analvsis 

The haplotype analysis was done using Haplotype Pattern Mining (HPM) (Toivonen et al 
2000) program. The data set provided a total of 132 informative trios, 396 study individixals, 
and revealing 304 unrelated affected and 220 control chromosomes. Haplotyping was done 
within each trio and four independent chromosomes were obtained from each trio. In case of 
ambiguities (missing genotype data, identical heterozygotic genotypes in all of the family 
members, or Mendel errors), the alleles were discarded. If the child was affected, the 
transmitted chromosomes were considered disease associated and the non-transmitted 
chromosomes as controls. If one of the parents was affected, his/her chromosomes were 
considered disease associated and the spouse's chromosomes as controls. If both the parent 
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and the child were affected, only the non-transmitted chromosome of an unaffected parent 
was considered as the control and the other three as disease associated. These haplotypes 
were used as input for HPM. 

RESULTS 

1 . Characterization GPRA at RNA And Protein Level 

A. Genomic Structure of Different Splice Variants of GPRA 

The fixlHength cDNA sequence of GPRA was assembled by using RT-PCR with primers 
designed for predicted exons and by using RACE-PCR to generate the 5' and 3' ends of the 
gene. The genomic location of the primers is shown in the Fig. 3 and sequence in the Table 4. 
Nested PGR amplification was used to produce full length cDNAs of different splice variants 
in one fragment. Using different (Marathon-Ready Brain cDNAs, RNA extracted form NCI- 
H358 cell line) template three splice variants were identified repeatedly (A, Biong, and C 
variants in the Fig. 3 and 4). All variants possess the same initiation site, but used alternative 
exons to encode the 3' end of the gene (exons 9A, 9B and 2B, respectively). The sequence 
flanking the putative ATG translation initiation site (GCCATGC) contains the -3 purine, but 
not the +4 guanine residue, of the Kozak consensus sequence. In two of the variants, the open 
reading frame was distributed across 9 exons producing cDNAs of 1 1 16 bp and 1 134 bp in 
size (variants A and Biong, Fig. 5 A and 5B) and encoding 371 and 377 amino-acid proteins, 
respectively. The C variant is a shorter transcript of GPJRA including only the exons El, E2a 
and E2b (Fig. 50). GPRA spanned 0.2 Mb of the genomic contig NT_000380. The second 
intron is the largest, comprising of 93.8 kb. 

When nested PGR specific for the B variant was applied in the cloning process a new splice 
variant was observed (Bshoit in Figs. 4B and 5B). A 33 bp deletion was observed in the 5' 
prime end of the exon 3. 

B. Expression Profiling of GPRA Using RT-PCR and Northern Blot Hvbridization 
RT-PCR analysis showed that different variants of GPRA were expressed in several human 
tissues. These tissues include testis, brain, pituitary gland, placenta, lung, heart, fetal thymus, 
and fetal heart. In addition to commercially available cDNAs, we analyzed the GPRA 
expression profile in NCI-H358 cell Une that represents broncho-epithelial origin. Variants A 
(JEG5F1 and Vau8R2), B (JEG5F1 and VaulOOORl) and C (JEG5F1 and JEGEXRl) 
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specific primer pairs were used in amplifications (Fig. 8, Table 4). All forms were caressed 
and using A variant specific primers, we found several transcripts. PGR fragments were 
cloned and sequenced. Sequence verification revealed variants A, D, E, and F (Figs. 4 and 5). 
Northern blot hybridization was done using a 470 bp cDNA probe comprising exons El, 
E2A, and E2B (Fig. 3). On Human 8-Lane MTN blot, the probe detected multiple transcripts 
sized approximately 6.5 kb, 6.0 kb, 1.8 kb, and 1.0 kb as shown for placental tissue in Fig. 9. 

C. The Predicted Structure oftheGPRA Protein 

Both TMpred and PredictProtein softwares predicted that the variants A and Biong encode a 
7TM protein with an extracellular N-terminus (approximately 50 amino acids) and 
intracellular C-terminus (Figs. 6A-6F). Their structure shared the common features of GPR 
family A with 16 conserved amino acids in the TM, loop, and C-terminal regions. These 
include among others two conserved Cys residues in exoloops 1 and 2 that potentially form a 
disulfide bridge; the Asp-Arg-Tyr sequence (DRY motif) in the proximity of TM3; Asn-Pro- 
X-X-Tyr motif in TM7; and a Cys residue in the C-terminal region (Figs. 6A-6F). The 
putative N-glycosylation site Asn"^ was predicted by PredictProtein. AU the other variants 
lacked the 7TM structure. For variant Bshort a 6TM structure was predicted (Fig. 6B). Variant 
C encodes 94 amino acid protem that has potentially only one TM region (Fig. 6C). For the 
variants D and E, deletions of the exons 3 or 4 caused a shift in the ORF, presumably 
producing truncated forms of the protein. When both exons were deleted (variant F), the ORF 
remained intact, but only five-transmembrane regions were predicted (Fig. 6F). 

D. Sequence Searches 

Determined by BlastP comparisons, the GPRA protein shows 31 % amino acid identity to 
human vasopressin receptor IB, 28 % identity to human vasopressin receptor 2, 32 % identity 
to human oxytocin receptor, and 43 % amino acid sequence identity to gene product CG61 1 1 
of Drosophila melanogaster which is considered as an orthology to human 
vasopressin/oxytocin receptor family (Park et al 2002). The best sequence identity between 
proteins was found in TM regions and in the first exoloop. Based on T Coffee homology 
comparisons, GPRA showed 20% amino acid identity to the bovine rhodopsm receptor. 
Sequence comparisons between GPRA and mouse ESTs BB638128, BB632343, BB625809, 
BB228269, and publicly available mouse genomic sequence suggest the existence of a mouse 
orthology. 
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2, Bxpression Profiling of GPRA Variantfi A. and B Usin p Western Blot Analysis and 
Immiinnhistocheniistrv 

Immimostaining of noma! adult human tissue samples with GPRA A variant specific 
antibody showed strongest expression in smooth muscle cells (SMC), such as SMC layer in 
bronchial and arterial waUs in human lung and colon shown in Fig. lOA and E. In alveolar 
wall and alveolar macrophages of lung (Fig. IOC) intense staining of GPRA A was detected. 
In colon epitheUum (Fig. lOD), strong basal stainmg was observed, whereas in the bronchial 
epithelium (Fig. lOA) and keratinocytes (Fig. lOF), only mild/weak immunostaining was 
detected. 



Western blot analysis with GPRA A variant specific antibody revealed four intensive bands 
corresponding to molecular weights of approximately 50, 44, 42 and 40 kDa (Fig. 12A). In 
skeletal muscle only 50 kDa band was detectable whereas uterine muscle, colon epitheUum 
and prostate showed similar expression patterns at 50, 44, and 40 kDa. The most intensive 
GPRA A expression was recorded in colon muscle with the major band at 42 kDa. Weak or 
nonexistent bands were found in spleen, kidney, and testis. 

Immunostaining with GPRA B variant specific antibody revealed most intense expression of 
the protein in the epithelium of several mucosal tissues including bronchus, small intestine, 
and colon as shown in Figs. 1 lA, D, E. Contrary to the A variant, the expression of the B 
variant was strongest in the apical surface of the epithelium (Figs. 1 lA, D, E, and F). 
Alveolar waUs and alveolar macrophages (Fig. 1 IC) showed strong and SMCs in several 
tissues mild immunostaining of the variant B. 

With GPRA B variant specific antibody, two sharp polypeptide bands corresponding to 
molecular weights of approxunately 39 and 25 kDa were revealed by Western blot analysis 
with the strongest expression in kidney (Fig. 12B). The specificity of the antibody was 
confirmed by the blocking experiment with lOx molar excess of the peptide used in 
immimization. 



Cos-1 cells were transiently transfected with pCMV-myc GPRA A, B, Bshort, C, D, E, or F 
in order to study translocation of different GPRA variants in vitro. After 48 hours, cells were 
fixed and studied by cell based ELISA assay using myc antibody. 71% of recombinant GPRA 
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A and 52% of recombinant GPRA B were translocated to the plasma membrane while all the 
other variants were located in cytoplasm (Table 15). 

3. Characterization Of GPRA as a Genetic Regulator of Asthma Related Traits 

A. Detection of GPRA Polymorphisms Associated to Asdima Related Traits 

To detect sequence variations in GPRA associated to asthma related traits, we sequenced its 
verified exons and exon-intron boundaries in four patients homozygous for the haplotype 
significantly associated with high serum total IgE level among the Finnish asthma &milies. 
Comparison between patients and one control individual from the same data set were made to 
the reference sequence (genomic contig NT_000380) in the public data base. 

A total of 14 SNPs were found distributed in seven exons of GPRA (Table 7). Four of the 
SNPs were predicted to cause a non-conservative amino acid change: Asn>Ile located in the 
furst extracellular loop, Ser>Arg in the third cytoloop, Ghi>Arg and Thr>Ile in the C termini, 
one in each alternative splice variant. Six of the SNPs were located in the 3' UTRs of the A, 
B, and C variants of the gene. 

B. Association Analvsis 

We have previously defmed a susceptibility haplotype that significantly associates with 
asthma related phenotypes (best marker-wise P value based on 10,000 permutations 0.001) 
(Fig. 3, gray area). The region covers the genomic region between GPRA exons 2b and 5. We 
replicated the association analysis with five SNPs genotyped in the exons of GPRA in the 
same data set (304 disease associated and 220 control chromosomes). Best haplotype 
associations of GPRA for high serum IgE level when one gap was allowed in the haplotype 
patterns are shown in Table 8. The best observed associations reached x2 values of S.7-10.3. 
Only one of markers (SNP591694) was part of the previously determined susceptibility 
haplotype and the same amino acid change, 107Ile, showed also significant association to 
high IgE level together with other GPRA SNPs. 

To study whether the overall haplotype distribution in GPRA differs in high IgE associated 
compared to that in control chromosomes, we used a permutation test. HPM analysis was 
done using the following parameters: maximum pattem length S markers, one gap allowed 
for missing data and possible errors, and chi-square threshold for the association >S.O. The 
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observed scores (=niimber of qualified haplotype patterns spanning across the marker) for 
associated haplotypes varied from 6 to 14. Permutation test showed statistically biased 
haplotype distribution for high IgE. Based on 10,000 simulations, the best marker-wise P 
value <0.01 was observed for a silent SNP (in position 640,764) in exon 5. P <0.02 was 
observed for three markers in the middle of the haplotype and P<0,04 for all the markers in 
the haplotype. 

Rxamplft yn 

This example describes a second gene occurring partially within the AST-1 locus and 
overlapping the GPRA gene. 

A. Materials and Methods 

1. Exon Prediction 

Exon predictions were performed by GENSCAN (http://genes.mit.edu/GENSCAN.html) 
software using genomic clone NT_000380 (Polvi et al. 2002). Predictions were performed for 
both unmasked and masked sequence. In the latter, the repetitive sequences were masked 
using the Repeat Masker Service (http://repeatmasker.genome.washington.edu/). 

2. Reverse Transcriptase PGR 

The exon-specific primers were designed based on exon predictions (Table9, Figurel3). First 
strand cDNAs were synthesized from 1 ^g of commercially available poly A* RNAs from 
human lung, testis, kidney and fetal liver (BD Clontech) by flie SMART RACE cDNA 
Amplification Kit (BD Clontech, Palo Alto, CA) according to manufectuier's instructions. 
Alternatively, Marathon-Ready cDNA (Clontech) or cDNA from Multiple Tissue cDNA 
Panels I and n (Clontech) were used as templates in some PGR amplifications. 

All the PGR amplifications ware performed in 25-50 ^1 volumes using 2.0-5.0 cDNA as 
template, 2 mM MgCU, 0.2 mM dNTPs (Finnzymes), 0.2 mM of each primer, 0.15-0.4 U 
AmpliTaqGold, and Ix PGR Gold buffer (Applied Biosciences) under the following 
conditions: 94''C for 5 min, 35-40 cycles of 94 °C for 30 s, 58 °C -62 °G for 40 s, 72 "C for 
1 min 30 s followed by 72 °C for 10 min. PGR products were analyzed on 1 % agarose gel 
and extracted from the gel using QIAquick Gel Extraction Kit (Qiagen). Purified PGR 
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products were analyzed by automated sequencing with dye-terminator chemistry (Megabase 
1000 sequencer^ Molecular Dynamics). 

3. Cell Culture and Isolation of Polv A"^ RNA 

In addition to commercially available poly RNAs, RNA from Ebstein Barr virus infected 
lymphoblast cell lines of the patients who were homozygous, heterozygous, or non-carriers of 
ASTl were cultured for expression studies. L3rmphoblasts were cultured in BPMI 1640 
medium (Gibco BRL) supplemented with 1 mM sodium pyruvate (Gibco BRL), 10% FCS 
(Biological Industries), and 1% Penicillin/Streptomycin (GibcoBRL), Poly A"^ RNA was 
isolated by Dynabeads mRNA DIRECT Kit (Dynal) according to the manufecturer's 
instructions. 

4. Northern Blot Hybridization 

The AAAl specific probe (mixture of the variants I, III, IV, VII, and X) was generated by 
RT-PCR using human lung, testis, kidney and fetal liver poly A^ RNAs as templates. The 
probe was radiolabelled with a[^^P]-dCTP using RediPrime Kit (Amersham Biosciences) 
according to the manufacturer's instructions. A Human Multiple Tissue 12-lane Northern 
blot, Fetal Multiple Tissue Northern blot, and Human Multiple Tissue Expression Array 2 
(BD Clontech) were prehybridized in ExpressHyb solution (BD Clontech) for 1 h at 68 °C 
followed by hybridization with the specific probe for 2-5 h at 68 °C. Herring sperm DNA 
(100 Hg/ml) was used as the blocking reagent. Filters were washed with 2 x SSC and O.OS % 
SDS at room temperature and exposed to X-ray film at -20 **C for 1 - 5 days. 

5. GenotvDing 

Genotyping of four SNPs that are located either in the coding region or near the exon-intron 
boundaries of AAAl was done 1) using altered restriction sites (SNP_538567 or 
SNP_574953, Table IB) or 2) single base pair extension (SBE) with the chemistry of 
Molecular Dynamics on a Megabase 1000 sequencer (Molecular Dynamics) according to the 
suggestion made by the manufacturer (Table 10). Allele calling was performed by using the 
MegaBACE SNP Profiler software (Molecular Dynamics, 

All the markers were in Hardy- Weinberg equilibrium and observed Mendel errors were less 
than 0.1%. 
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6. In vitro translation of the AAAl and characterization of AAAl antibody 
In order to investigate whether AAAl is translated to any polypeptide, in vitro translation 
experiments were performed (Figure 21). Capped RNAs of AAAl gene variants defined by SEQ 
ID NOS: 16 and 22 were transcribed firora DNA constructs with the aid of T7 RNA polymerases 
(mMESSAGE mMASCHINE system, Ambion, USA). Translation was performed with rabbit 
reticulocyte lysate translation machinery (Riboprobe in vitro translation system, Promega) in the 
presence of S^^ -labelled methionine in the reaction mixture. The Xenopus elongation factor a 
(pTRI-Xef) DNA template was used as a positive control for transcription and translation. In 
negative control, water was used instead of DNA. The translated polypeptides were detected by 
autoradiography after Tris-Tricine SDS-PAGE. 

B. Results 

1. Characterization of AAAl at RNA Level 

A. Genomic Structure of DiflFerent Splice Variants of AAAl 
The human AAAl gene spans 520 kb of the genomic contig NT_000380 (nucleotides 
163615--684776) and it is divided into 18 exons (Figure 13 and Table 11). The cDNA 
sequences of AAAl splice variants were assembled by using RT-PCR with primers designed 
for predicted exons. The genomic location of the primers is shown in the Figure 13 and the 
sequence in the Table 9. Using cDNAs from human lung, kidney, testis and fetal liver as 
templates, twelve splice variants were identified repeatedly (variants I-Xn in the Figure 14), 
All variants share exon 6, but use alternative exons to encode the 5' and 3' ends of the 
transcript. In six variants out of ten, the sequence flanking the putative ATG translation 
initiation site (GCCATGC) contains the -3 purine, but not the +4 guanine residue, of the 
Kozak consensus sequence. 

B. Expression Profiling of AAAl Using RT-PCR and Northern Blot Hvbridization 
Northern blot analysis of a multiple tissue expression array showed that AAAl is expressed 
in several human tissues (Figure 16). Strong expression was seen in testis, brain, placenta, 
lung, heart, skeletal muscle, kidney, liver, fetal liver, and fetal lung. In multiple tissue 
northern blots, two main transcripts (2.4 and 7.5 kb in size) were detected (Figure 17). 
Additionally, all sample lanes showed extensive label streaming which can be an indication 
of several low-abundant transcripts with different sizes. 



wo 2004/056866 



61 



PCT/FI2003/000973 



By RT-PCR exceptionally strong splicing and tissue specific differences in AAAl 
expression could be found (FigurelS), For example, expression of splice variants I and IV 
was particularly strong in the lungs consistent with the association between AAAl and 
asthma. 

In the in vitro translation experiment, the Xenopus elongation factor a (pTRI-Xef) DNA template 
used as a positive control resulted in the synthesis of a 50-kDa polypeptide as expected. However, 
neither of the two investigated constructs was translated mto polypeptides strongly arguing that 
AAAl functions ^s a non-coding RNA gene (Figure 21). 

To further investigate the translation of the AAAl gene, a polyclonal antibody against the constant 
region (YVRRNAGRQFSHC) of the gene product was produced in rabbits. AAAl peptide 
synthesis and antibody production were purchased firom Sigma-Genosys Ltd (London Road, 
Pampisford, Cambridge). To test the specificity of the antibodies, Glutathione S-transferase (GST) 
-fusion proteins for AAAl were produced with the pGEX 4T-3 GST fusion expression vector 
(Amersham Biosciences) according to the manufacturer's instructions. (Figure 21). The antibody 
displays high affmity against the the recombinant AAAl protein produced in bacterial lysate with 
no cross-reactivity between tiie GST construct alone. In spite of that, the antibody did not reveal 
any reactivity either in Western blots (spleen, skeletal muscle, uterine muscle, colon muscle, colon 
epithelium, kidney, testis and prostate) or in immunohistochemistry (bronchial tissue, HepG2 cell 
line) (data not shown). 

In addition, formalin fixed, paraffin-embedded specimens of normal adult human bronchus tissue 
and HepG2-cells (positive for AAAl RT-PCR) were used for immunohistochemistry. 
Immunohistochemical analyses were performed using rabbit AAAl antibody and the ABC method 
(Vectastain Elite ABC kit. Vector Laboratories, Burlingame, Califomia).Finally, AAAl does not 
appear to have a mouse counterpart. Concluding firom our data, AAAl may be a non-coding RNA 
gene and is unlikely to encode a functional protein. 

C. The Predicted Structure of the AAAl Peptides 
The cDNA sequences of twelve AAAl splice variants are shown in Fig. 20. All predicted 
AAAl proteins are small peptides (size firom 34 aa to 74 aa) that do not show significant 
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identity to any known modular structures or motifs. All isoforms contain the same core 
sequence (ayvrrnagrqfshcnlhahqflvrrkq) flanked by alternative amino- and carboxyterminal 
tails (Figure 15). 

2. Characterization Of AAAl as a Genetic Regulator of Asthma Related Traits 

A. Association Analysis 
ASTl covered the exons 3 to 10 of AAAl (Figure 13). For the association analysis we chose 
one exonic SNP (SNP SI 7278) and four additional SNPs near exon-intron boundaries of 
AAAl (Table 12). All the polymorphisms were identified previously (Table 3). The analysis 
for high total serum IgE level was done in the same data set as previously (304 disease 
associated and 220 control chromosomes, Table 2 and 8). Best haplotype associations of 
AAAl are shown in Table 12. The best observe associations reached the %1 values of 8.9- 
13.6 and the permutation test showed statistically biased haplotype distribution for high IgE. 
Based on 10,000 simulations, the best marker-wise P value <0.0001 was observed for 
SNP_5 17278 in exon 10. Corrected P value for testing multiple markers simultaneously was 
0.0002. 

B. Variable alternative splicing for AAAl depending on genotype 
To study further whether ASTl effects on the expression of AAAl we studied lymphoblast 
cell lines from the asthma patients who were homozygous, heterozygous, or non-carriers of 
ASTl. Using RT-PCR witii the primer pair SCFIO and ASKAR (Table 9) significant 
differences were found between patients and the differences were dependent on genotype 
(Figure 19). Only the non-carrier of ASTl processes normal amount of the exon 6-lOb 
transcript, whereas the homozygote and heterozygotes show either an absent transcript or 
smaller splice variants. Differences m expression patterns suggest that ASTl can effect 
splicing of the gene and thereby, increase the risk of asthma related diseases among ASTl 
carriers compared to that among ASTl non-carriers. 
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Although the foregoing invention has been described in some detail for purposes of clarity 
and understanding, it will be clear to one skilled in the art from a reading of this disclosure 
that various changes in form and detail can be made without departmg from the true scope of 
the invention. The above examples are provided to illustrate the invention, but not to limit its 
scope; other variants of the invention will be readily apparent to those of ordinary skill in the 
and are encompassed by the claims of tiie mvention. The scope of the invention should, 
therefore, be determined not with reference to the above description, but instead should be 
determined with reference to the appended claims along with their fiill scope of equivalents. 
Any embodiment, feature, step or element described above can be used in combination with 
any other unless otherwise apparent from the context. All publications, references, and patent 
documents cited in this application are incorporated by reference in their entirety for all 
purposes to the same extent as if each individual publication or patent document were so 
individually denoted. 
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Tables. Primer pairs used In re-sequecing of the exons and exon/intron bondaries of GPRA 



Primer 


Forward primer 


Reverse primer 




aClCagClgCEggagCEEg 


igacacicicaaguccagcagcc 


\JrJ\A CXza 


aggagga&gaaatccagcct 


tgaccgatgcgttacattti 


KXJrJx/i GaXu. 1 


gCCaiavauguagloaCClgaaa 


ccTCaiccagggaiauigc 


GPRA ex2b.2 


ttcctaccaacaagaactccaa 


cgatgatgaattagaacatacaacttt 




taagtcaaagaactcctaccttgc 


agcaaaggaaatacattaaaaatcaaa 


GPiM ex4 


ctgccctctttcacccagta 


agccacccaccttccttagt ^ 


GPBAexS 


gcttctgttcaagcttcccttt 


gtgtggtcctgtectgacg 


GPRA ex6 


tggatcctcatggtcactttc 


tctctgctggcatagcttga 


GPiL4 ex7 


ttgagagagtctgagcattcca 


ccaaattattcaacccatagcc 


GPRAexA 


tgacatcaatgctccaaacaa 


tgtcatgattaaggcggtttc 


GPRAexBa. 


ttaacatgtctacttgccttttca 


tctgcaaaccgaggctatct 


GPRA ex9b 


gcagagctgtcacccaaaat 


agcctgggcaacaagagtaa 
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Allele 2 




a 


*- 


u 












Product size* 


290+50 


240+215 


220+50 


140+260 










• 


Allele 1 




CO 


u 








ITable 6B. Primer pairs, rectrction enzymes, and con-esbonding allele sizes used in genotyping of GPRA exonic SNPs. 


r 




Product size* 


CO 


CO 
iO 


o 

CM 


o 
o 










Enzyme 


Sapl 


Mboll 


Ddel 


z 

a 

CO 






r in the name of the SNP shows its position in the genomic contig NT 000380. 




Reverse primer 


tagcacaatgcctgccctat 


gtcttgtgcatctcccaggt 


ctagcactggcactgcccta 


tctgcaaaccgaggctatct 








FoHA/ard primer 


aactgactaaactaggtgccacgtcgtccatacatgaccatcgtgctctt 


tgtctacctgttggcctgtg 


catggagaaggaaggtcagg 


atgactgcatgcactgctta 




r product sizes are approximitations 


Six digit numbe 




Marker 


SNP_640764 


N 
CO 

? 

z 

CO 


SNP__662803 


CO 

i 

z 

CO 
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Table 8, Best haplotype associations of GPR4 for high serum IgE level in the data set of 304 disease 
associated and 220 control chromosomes when one gap (marked with an asterisks) was allowed in the 
h^lotype patterns. 



High IgE Control 


Conf 






G P R A 


markers 




SNP591694 SNP640763 SNP662763 SNP662803 SNP663133 
Exon3 Exon5 . Exon9A £xon9A Exon9A 






0.761 


10.3 




L> 












• 

0.759 


8.5 


T 
1 




m 






40 

•TV 


19 


0.769 


8.5 






■ 


T 








0.765 


7.9 








T 


c 


39 


12 


0.765 


7.9 




c 


G 




•* 


39 


12 


0.765 


7.9 




c 


G 


T 




38 


12 


0.76 


7.3 




c 


G 




c 


38 


12 


0.76 


73 




c 


G 


T 


c 


35 


11 


0.761 


6.8 


T • 

* 


c 


* 


T 




34 


11 


0.756 


6.2 


T 


c 


* 


T 


c 


34 


11 


0.756 


6.2 


T 


c 


G 






34 


11 


0.756 


62 


T 


c 


G 


T 




33 


11 


0.75 


5.7 


T 


c 


G 


* 


c 


33 


11 


0.75 


5.7 


T 


c 


G 


T 


c 



High IgE, nmnber of disease-associated haplotypes with a specific pattern; Control, number of control 
haplotypes with a specific pattern; Confidence, percentage of haplotypes with the specific pattern that is 
associate with disease, chi-square value for disease association of the specific haplotype pattem. 
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Table 9, Primers used for cloning of full length cDNAs for AAAl. 



Primer 


Exon 


Sequence 


IF 


1 


5' AGAATGAGTCTCTGATGACrrr 3' 


nF 


2 


5'ACTTGCTGTTCATAGAATTGCAA 3' 


SF9 


4 


5'TGACriTCTCCCCAGATTTTTGTAT 3' 


exIF 


•5 


5' ACACATACAAAGTGCCTACCACAT 3' 


SR13 


9 


S'TTGAAACTGTATTTCCCATATTGC 3' . 


ASKAF/R 


10b 


5 ' AAATGCAATAAAAATGCGGAACTA 3 ' 


XR 


11 


5' GAGTCATTAGTCCAGAGAACAT 3' 


xm 


12 


5'CTGCTTGGAACAGTGTATATC 3' 


XUIR 


14 


5'TGGTCTACGTAGAATTCAGAGTA 3' 


XVR 


16 


5'CATGTGITAATTGTGTCTTCACT 3' 


SR4 


19 


5'GGGTGTCATTTACACGAACAATAA 3' 


SFIO 


6 


5XAGTTCAGTCACTGCAATCTrCAT 3' 
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Sis 

.2 B o 

S •§ & 





(0<V) 


1 iQiO* *00 lO* * *o*o 


locus 


(OO) 


• OO** lOOOOO lOOO 

1 


:kers of the ASTl 


(0<0 

60t6t«*d[K[S 
9001911] 

. (V<0) 


• 


• 


Clo) 


1 






13.548 
12^79 
12.5479 
12J318 
12.2929 
12.2929 
1Z2377 
12 2377 
11.4627 
11.4627 
11.3004 
11.2636 
10.9929 
10.3762 
10.2633 
9.0248 
8.957 




Control 


?ip:!::5F:F:??f:i::F:?:p:j::p:gi[:: 




High 
IgE 


59/187 (24%) 

49/190(21%) 

49/190(21%) 

117/130(47%) 

53/189(22%) 

53/189(22%) 

47/180(21%) 

47/180(21%) 

47/190 (20%) 

47/190 (20%) 

49/190(21%) 

103/136 (43%) 

50/187(21%) 

47/189(20%) 

47/190(20%) 

60/184(25%) 

53/189(22%) 
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Table 13. Hap(otype specific AST1 polymorphisms for diagnostic testing. These markers can be 
used for the haplotype identification of an individual without phase infomnatlon. 



Haplotype 


Type of 


rosmon in 


rostuon in 


AueiG preseni in uie 


Miieie present in 




poiyinorpnisin 




KIT nnn^Qn 


sposinc napioQrpe 


Oilier napioiypes 


HI 


SNP 


4631 


514413 


T 




HI 


SNP 


51975 


561757 


G 




HI 


SNP 


53922 


CM^rt ill 

563704 


IT 

T 




H2 


SNP 


5634 


515416 


A 


0 


H2 


SNP 


6739 


516521 


A 


G 


H2 


SNP 


7550 


517332 


A 


G 


H2 


deletion 


9199-9201 


518981-83 


deletion of TCT 


no deletion 


H2 


SNP 


16980 


526762 


d^ 

C 


T 


H2 


SNP 


17147 


526929 


A 


G 


H2 


SNP 


17435 


527217 


G 


A 


H2 


SNP 


19272 


529054 


T 


A 

A 


H2 


SNP 


19452 


529234 


A 


G 


H2 


SNP 


20309 


530091 


A 


G 


H2 


SNP 


20789 


530571 


T 


G 


H2 


SNP 


24869 


534651 


d^ 

C 


T 


H2 


SNP 


32976 


542758 


C 


T 


H2 


SNP 


34716 


544498 


c 


A 


H2 


deletion 


38850-52 


548632-34 


deletion of CTC 


no deletion 


H2 


SNP 


50955 


560737 


C 


G 


H2 


SNP 


51476 


561258 


G 


A • 


H2 


SNP 


52573 


^^^^^^ 

562355 


A 


G 


H2 


SNP 


55134 


564916 


A 


G 


H2 


SNP 


56856 


566638 


T 


C 


H2 


SNP 


65857 


575639 


G 


T 


H2 


SNP 


66526 


576308 


T 


C 


H2 


SNP 


66902 


576684 


G 


A 

A 


H2 


SNP 


72270 


Ji^/H f"^\ 

582052 


A 


G 


H2 


^■s & ft 

SNP 


111012 


620794 


T 


C 


H2 


SNP 


112037 


621819 


G 


T 


H2 


SNP 


115192 


624974 


G 


U 


H2 


Aft. 

SNP 


123770 


633552 


G 


A 


H2 


SNP 


123788 


633570 


C 


o 


H3 


SNP 


22869 


532651 


c 


T 


H4 


deletion 


7334-35 


tf* id i^ J J% id *4ft 

517116-17 


deletion of AT 


no deletion 


H4 


SNP 


16893 


526675 


C 


T 


ft • A 

H4 


& ft 

SNP 


17209 


526991 


A- 


C 


ft i A 

H4 


SNP 


26198 


535980 


err 




H4 


SNP 


32124 


541906 


A/C* 


c 


ft ft ^ 

H4 


SNP 


36551 


546333 


A 


G 


H4 


SNP 


116032 


625814 


A/G 


u 


H6 


SNP 


54199 


563981 


C 


T 


H6 


SNP 


57790 


567572 


c 


A 

A 


ft ft 

H6 


SNP 


64559 


574341 


T 


C 


H7 


SNP 


1 


509783 


T 


c 


H7 


SNP 


93 


509875 


G 


A 


H7 


SNP 


918 


510700 


A 


G 


H7 


SNP 


983 


510765 


T 


A 


H7 


SNP 


987 


510769 


C 


T 


H7 


SNP 


1542 


511324 


T 


C 


H7 


SNP 


1710 


BA A A nn 

511492 


G 


A 


H7 


SNP 


1818 


511600 


T 


c 


H7 


SNP 


1927 


511709 


T 


A 


H7 


SNP 


2254 


512036 


C 


T 


H7 


SNP 


4689 


514471 


G 


C 


H7 


SNP 


5442 


515224 


G 


c 


H7 


SNP 


18383 


528165 


A 


G 


H7 


SNP 


18978 


528760 


G 


A 


H7 


SNP 


19871 


529453 


A 


G 


H7 


SNP 


28866 


538648 


C 


G 


H7 


SNP 


52776 


562558 


C 


G 



* A rare polymorphism found only in H4 haplo^e 



wo 2004/056866 



86 



PCT/FI2003/000973 



Table 14. ASTI polymorphisms specific for the different haplotype comblrtatlons for diagnostic 
testing. These matHers can be used for the identification of certain haplotype combinations of an 
individual without phase information. 



ricipioiypQ 


Tuna nf 

1 ype or 


ruoiuviiii in 


ruoiuon In 


Allele pieoeiii in 




COiMuinBUwn 


poi ymorpnioni 


fipo in Mn*i 

OCVii nsj* 1 


KIT nnfiQfln 


uio spssinc iidpiU' 


111 uuioi fiapiu* 










typo curnifltiauuii 












A 


r» 


n4+nD 


olMr 


40007 




T 

I 


r> 
\s 




CMD 










n«»Ti3 


CKID 


•4074 0 


KOQAQA 


A 


W 




aeieuon 






QBlBuOn Of 1 V3 


no Qeieciori 




CKID 








1 




CKID 




30* 100 


T 
1 


w 




CKID 






A 

A 






QMD 


OOOOU 




A 


w 




CKID 

OINr 


30790 


940000 


A 

A 


/2 
S3 




CMP 
OINr 


04Ow^ 




p 






insenion 




oo^uoo~oy 


iDoQiuon or v^w 


no insernon 




QMD 


ooouo 


OOOOOD 


T 

1 






QMD 
OINr 




0D4/0Z 


A 


r> 
0 




QMD 
Onlr 


000 Do 


00040D 


1 




n4+nt3 


QMD 

oNr 


D7yiy 


5/ f fUl 




T 
1 


n4+no 


OINr 


7o101 


CQCOOO 

oooooo 




G 


l_l A J.LJC 

H4+no 


oNr 


o23o2 


4 A 

5921 14 


T 


C 


n4+no 


oNr 


1 1 2726 


62250 0 


A 


T 


n4+no 


OKID 

oINIP 


113944 


o2372o 


G 


T 


pW+no 


oNr 


4 4 A nA c 

114945 


624727 


G 


A 

A 


UA ^UIC 

n4+nb 


oNH 


4 4 C A C A 

1164o4 


o2o24o 


A 

A 


G 


n4Trio 


QMD 

OINr 




DO044y 


A 


/-» 
0 


n2+n4+n5 


oNP 


4012 


513794 


A 

A 


C 


LJO_i_LJ A 1 [ 1 f 

n2+n4+H5 


SNP 


4981 


514743 


G 


A 

A 


n2+n44-H5 


SNP 


5850 


515632 


G 


A 

A 


H2+n4+n5 


SNP 


6312 


516094 


T 


C 


H2+n4+H5 


SNP 


6392 


516174 


T 


G 


H2+H4+H5 


SNP 


6485 


51 6267 


A 


G 


H2+n4+n5 


OMR 

SNP 


6522 


516304 


G 


C 


n/+n4+no 


QMD 

OINr 


DD40 


010420 




A 

A 


LJOxUI/l ^LIC 


QMD 
OINr 


b7bU 


010542 


C 


T 


n^+n4+nt5 


□eietion 


0004 

Do21 


olbbOJ 


deletion of 1 


no deletion 


LJO^LJ>1 _t-LJC 


/4 At) An 

deiouon 


7240-43 


517022-25 


deletion of ACTT 


no deletion 


nz+n4+nb 


CMD 

oNP 


7277 


517059 


G 


C 


ni£+n4+no 


oNP 


730o 


5170oO 


T 


G 


LiOxLt/1 a.lJC 


QMD 

oNP 


7305 


m oT 

517087 


C 


G 


LJOxLJjIxLJC 

n^+n4+no 


aeieuon 


Tone Q 

7300-0 


51 70So-yu 


deletion or tgt 


no deletion 


n n*f ^ no 


QMD 
OINr 


f 4yD 


01 1 0 








QMD 
OINr 


o4yu 


m QO70 


1 






QMD 
OINr 


yo4y 


01 y^foi 


0 


1 




uoteuon 


y/oz-yr 00 


0 iyoo**-Of 


aeieiion or 0 1 0 1 


no oeieuon 




QMD 

OINr 


1 1 ODO 


OZ 1 04U 




A 

A 


n2+n4+nb+n/ 


oNP 


3877 


513659 


c 


G 


n2+n4-fn5+n7 


OK.ID 

oNP 


7229 


517011 


T 


C 


n2+n4+no+n7 


OMO 

oNP 


4 (\Q 4 Q 

109 10 


O2059o 


C 


T 


n2+n4+n5+n7 


SNP 


12581 


522363 


c 


G 


H2+H4+H5+H7 


K, 1 

SNP 


19360 


529142 


A 


G 


H2+H4+H5+H7 


SNP 


19774 


529556 


A 


C 


H2+H4+H5+H7 


SNP 


20038 


529820 


C 


T 


n^+n4+n5+n7 


OKID 


20395 


530177 


Q 


T 


H6+H7 


SNP 


22475 


532257 


C 


T 


H6+H7 


SNP 


22493 


532275 


G 


A 


H6+H7 


SNP 


22715 


532497 


A 


G 


H6+H7 


SNP 


22934 


532716 


T 


A 


H6+H7 


SNP 


24264 


534046 


T 


G 


H6+H7 


SNP 


26356 


536138 


T 


C 


H6+H7 


SNP 


26675 


536457 


G 


A 


H6+H7 


SNP 


28197 


537979 


A 


G 


H6+H7 


SNP 


28858 


538640 


C 


T 


H8+H7 


SNP 


31224 


541006 


A 


G 


H6+H7 


SNP 


32185 


541967 


C 


T 


He+H7 


deletion 


34909 1544691 


deletion of T 


no deletion 
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Table 14. (Continued) 



Haplotype 


Type of 


Position in 


Position in 


Allele present In 


Allele present 


combination 


polymorphism 


S£Q ID N0:1 


NT_000380 


the spesific hapio- 


in other hapto- 










type combination 


types 


H6+H7 


SNP 


37327 


547109 


T 

1 


Q 




SNP 


37415 


547197 


A 


G 


H8+H7 


SNP 


37685 


547467 


G 


A 


H6+H7 


SNP 


37959 


547741 


T 


c 


H6+H7 


SNP 


39343 


549125 


T 


G 


H6+H7 


SNP 


50493 


560275 


A 


6 


H6+H7 


SNP 


50835 


560617 


c 


A 


H6+H7 


CA-repeat 


51022-49 


560804-31 


(CA)8 


(CA)6TA(CA)7 


H6+H7 


SNP 


51536 


561318 


T 


C 


H6+H7 


SNP 


51884 


561666 


T 


G 


H6+H7 


SNP 


60604 


570386 


T 


A 


H0+H7 


SNP 


61165 


570947 


A 


G 


H6+H7 


SNP 


75115 


584897 


A 


G 


H6+H7 


SNP 


82922 


592704 


A 


T 


H6+H7 


SNP 


83552 


593334 


T 


c 


H6+H7 


SNP 


85271 


595053 


A 


G 


H6+H7 


SNP 


110989 


620771 


c 


T . 


H8+H7 


SNP 


112030 


621812 


c 


G 


H6+H7 


SNP 


113428 


623210 


Q 


C 


H6+H7 


SNP 


115628 


625410 


c 


T 

• 


H6+H7 


SNP 


116515 


626297 


A 


G 


H6+H7 


SNP 


116926 


626708 


C 


T 


H6+H7 


SNP 


117276 


627058 


A 


T 


H2+H6+H7 


SNP 


28770 


538552 


T 


c 


H2+H6+H7 


SNP 


31910 


541692 


A 


G 


H2+H6+H7 


SNP 


37931 


547713 


T 


c 


H2+H6+H7 


SNP 


39314 


549096 


A 


G 


H2+H6+H7 


SNP 


50197 


559979 


T 


G 


H2+H6+H7 


SNP 


50334 


560116 


A 


G 


H2+H6+H7 


SNP 


50632 


560414 


A 


G 


H2+H6+H7 


SNP 


51217 


560999 


A 


G 


H2+H6+H7 


SNP 


51861 


561643 


G 


c 


H4+H5+H6+H7 


SNP 


54148 


563930 


A 


G 


H4+H5+H6+H7 


SNP 


54641 


564423 


c 


G 


H4+H5^-H8+H7 


SNP 


54751 


564533 


c 


T 


H4+H5+H6+H7 


SNP 


60559 


570341 


G 


c 


H4+H5+H6+H7 


SNP 


66164 


575946 


T 


c 


H4+H5+H6+H7 


SNP 


67867 


577639 


A 


T 


H2+H7 


SNP 


16845 


626627 


C 


T 


H2+H7 


SNP 


20089 


529871 


A 


T 


H2+H4+H5+H6+H7 


SNP 


21650 


531632 


T 


C 


H2+H4+H5+H6+H7 


SNP 


36909 


546691 


A 


C 


H2+H4+H5+H6+H7 


SNP 


39927 


549709 


C 


T 


H2+H4+H5+H6+H7 


SNP 


45626 


555608 


T 


C 


H2+H4+H5+H6+H7 


SNP 


66171 


574953 


G 


A 


H2+H4+H5+H6+H7 


SNP 


112283 


622065 


T 


A 


H2+H4+H5+H6+H7 


SNP 


112859 


622641 


C 


A 
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1 . An isolated GPRA polypeptide comprising an amino acid sequence 
that has at least 90% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ. ID NOS: 3, 5, 7, 9, 11, 13 and 15 over the entire length of the selected 
SEQ ID NO. when compared using the BLASTP algorithm with a wordlength (W) of 3, and 
the BLOSUM62 scoring matrix. 

2. The isolated GPRA polypeptide of claim 1 comprising an anuno acid 
sequence selected from the group consisting of B-Long (SEQ ID NO: 5), B-Short (SEQ ID 
N0:7) , C (SEQ ID N0:9), D (SEQ. ID N0:1 1), E (SEQ ID NO: 13) and F (SEQ ID NO: 15) 
provided that up to 34 amino acids can be substituted, deleted or inserted relative to the 
selected SEQ ID NO. 

3. The isolated GPRA polypeptide of claim 1 comprising the amino acid 
sequence selected from the group consisting of B-Long (SEQ ID NO: 5), B-Short (SEQ ID 
N0:7) , C (SEQ ID N0:9), D (SEQ. ID NO: 1 1), E (SEQ ID NO: 13) and F (SEQ ID NO: 15) 

4. An isolated GPRA polypeptide comprising at least 1 0 contiguous 
amino acids from amino acids 343-377 of B-long (SEQ ID N0:5). 

5. An isolated GPRA polypeptide comprising an amino acid sequence 
that has at least 80% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ. ID NOS: 3, 5, 7, 9, 1 1, 13 and 15 over a sequence comparison window of 
at least 40 amino acids when compared using the BLASTP algorithm with a wordlength (W) 
of 3, and the BLOSIJM62 scoring matrix provided that the polypeptide includes a variant 
amino acid encoded by a variant form shown in Table 7. 

6. An isolated GPRA polypeptide of claim 1 comprising the amino acid 
sequence of SEQ ID NO: 3 provided that the sequence contains an amino acid substitution of 
Asn 107Ile, Arg241Ser, or Gln344Arg. 

7. An isolated GPRA polypeptide of claim 1 comprising the amino acid 
sequence of SEQ ID N0:5 provided that the sequence contain as an amino acid substitution 
of Asn instead of lie at codon position 107, Arg instead of iSer at codon position 241, and/or 
Thr instead of He at codon position 366. 
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8 . An isolated nucleic acid encoding the GPRA polypeptide of any of the 
preceding claims. 

9. The isolated nucleic acid of claim 8 that hybridizes under highly 
stringent conditions to any of SEQ ID NOS: 4, 6, 8, 10, 12, and 14 wherem the highly 
stringent conditions are 6 x NaCl/sodium citrate (SSC) at about 45 °C for a hybridization 
step, followed by a wash of 2 x SSC at 50 °C 

10. An isolated nucleic acid of claim 8 that hybridizes under highly 
stringent conditions to any of SEQ ID NOS: 1, 4, 6, 8, 10, 12, and 14 without hybridizing 
under the same highly stringent conditions to SEQ ID N0:2, wherein the highly stringent 
conditions are 6 x NaCl/sodium citrate (SSC) at about 45 X for a hybridization step, 
followed by a wash of 2 x SSC at 50 ^C. 

11. An isolated nucleic acid of claim 8 having a sequence that is at least 90 
% identical to a nucleic acid having a sequence selected from the group consisting of SEQ ID 
NO: 4, 6, 8, 10, 12, and 14 over the entire length of the selected SEQ ID NO when compared 
using the BLASTN algorithm with a wordlength (W) of 1 1 , M=5, and N= -4. 

12. An isolated nucleic acid of claim 8 having a sequence that is at least 80 
% identical to a nucleic acid having a sequence selected from the group consisting of SEQ ID 
NO: 1, 2, 4, 6, 8, 10, 12, and 14 over a sequence comparison window of at least 100 
nucleotides when compared using the BLASTN algoriflun wilh a wordlength (W) of 1 1, 
M=5, and N= -4 provided that the nucleic acid includes a polymorphic site occupied by a 
variant form as shown in Table 3 or Table 7. 

13. The isolated nucleic acid of claim 12, wherein the variant form is at a 
polymorphic site not designated by a * in Table 7. 

14. An isolated nucleic acid of claim 8 having a sequence that is at least 80 % 
identical to a nucleic acid having a sequence selected from tiie group consisting of SEQ ID 
NO: 1, 2, 4, 6, 8, 10, 12, and 14 over a sequence comparison window of at least 100 
nucleotides when compared usmg tiie BLASTN algorithm with a wordlength (W) of 1 1, 
M=5, and N= -4 provided fliat the nucleic acid includes a polymorphic site occupied by a 
reference form designated with a * in Table 7. 
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15. An isolated genomic DNA molecule or a minigene having at least one 
intronic sequence and encoding a GPRA polypeptide that has at least 80 % sequence identity 
to an amino acid sequence selected from the group consisting of SEQ. ID NOS: 3, 5, 7, 9, 1 1, 
13 and 15 over a region at least 40 amino acids in length when compared using the BLASTP 
algorithm with a wordlength (W) of 3, and the BLOSUM62 scoring matrix . 

1 6. The isolated genomic DNA molecule or minigene of claim 1 5, 
comprising at least exons III-IV of SEQ ID NO: 2. 

17. The isolated genomic DNA molecule or mmigene of claim 15, wherein 
the intronic sequence is from intron 2 or 3 of a GPRA gene. 

18. The isolated genomic DNA molecule or minigene of claim 1 5, wherein 
the GPRA polypeptide includes amino acids 343-377 of B-Long (SEQ ID N0:5) 

19. The isolated genomic DNA molecule or minigene of claim 15, wherein 
the polypeptide is SEQ. ID NOS: 3, 5, 7, 9, 1 1, 13 or 15. 

20. The isolated nucleic acid, genomic DNA molecule or minigene of any 
of clauns 8-19, linked to a second nucleic acid witii which it is not naturally associated. 

2 1 . The isolated nucleic acid, genomic DNA molecule or minigene of 
claim 20, wherein the second nucleic acid includes a heterologous promoter operably linked 
to a gene within the isolated nucleic acid. 

22. A vector comprismg the isolated nucleic acid, genomic molecule or 
minigene nucleic acid of any of claims 8-21. 

23 . A host cell comprising the vector of claim 22. 

24. An antibody that specifically binds to an epitope within amino acids 
343-377 of B-long (SEQ ID N0:5) or amino acids 332-366 of B-short (SEQ ID N0:7). 

25. A method of preventing or treating asthma, other IgE mediated disease, 
chronic obstructive pulmonary disease or cancer, comprising 

administering to a patient sufiFering from or at risk of asthma, other IgE 
mediated disease, chronic obstructive pulmonajy disease or cancer an effective amount of a 
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modulator of a GPRA polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ. ID NOS: 3, 5, 7, 9, 1 1, 13, and 15. 

26. The method of claim 25, further comprising administering an effective 
amount of a modulator of a AAAl polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NOS: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 
41. 

27. The method of claim 25, wherein the modulator binds to the GPRA 

polypeptide. 

28. The method of claim 25, wherein the modulator inhibits expression of 
the GPRA polypeptide. 

29. The method of claim 25, wherein the modulator is a transcript of an 
AAAl gene or an inhibitor of expression of an AAAl gene. 

30. The method of claim 26, wherein the modulator of the AAAl 
polypeptide binds to the AAAl polypeptide. 

3 1 . The method of claim 26, wherein the modulator of the AAAl 
polypeptide inhibits expression of the AAAl polypeptide. 

32. A method of identifying a modulator of a GPRA polypeptide, 

comprising 

contacting a cell expressing a GPRA polypeptide with an agent; 

determining whether the agent modulates expression of the GPRA 
polypeptide and/or signal transduction through the GPRA polypeptide, wherein the GPRA 
polypeptide is defined by any of claims 1-7. 

33. The method of claim 32, wherein the cell further expresses an AAAl 

polypeptide. 

34. A method of determining risk of asthma, other IgE mediated disease, 
chronic obstructive pulmonary disease or cancer, comprising; 

determining whether the individual has a variant polymorphic form in 
a GPRA gene, wherein presence of the variant polymorphic form indicates risk of asthma, 
other IgE mediated disease, chronic obstructive pulmonary disease or cancer. 
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35. The method of claim 34, wherein the variant form occurs in a 
noncoding region of the GPRA gene 

36. The method of claim 34, wherein the variant form occurs in a coding 
region of the GPRA gene 

37. The method of claim 34, wherein the variant form occurs between 
introns 1 and 4 of the GPRA gene. 

38. The method of claim 34, wherein the determining comprises 
determining whether the individual has a variant form relative to SEQ ID NO: 1 (AST-1 
locus). 

39. The method of claim 34, wherein the determinmg comprises 
determining whether the individual has a variant form relative to any of SEQ ED NOS: 1, 3, 5, 
7, 9,11,13, and 15. 

40. The method of claim 34, wherein the variant form is a variant form 
shown in Table 3. 

41 . The method of claim 40, wherein the variant form is at a polymorphic 
site shown in Table 3. 

42. The method of claim 34, wherein the variant form is a variant form 
shown in Table 7. 

43 . The method of claim 34, wherein the variant form is a variant form at a 
polymorphic site not designated * in Table 7. 

44. The method of claim 34, wherein the determining comprising 
determining whether the individual has variant polymorphic forms relative to SEQ ID NO: 1 
at each of a plurality of polymorphic sites within the AST-1 locus, the presence of variant 
polymorphic forms at two or more of the plurality of polymorphic sites indicating increased 
risk of asthma, other IgE mediated disease, chronic obstructive puhnonary disease or cancer. 

45. The method of claim 34, further comprising determming whether the 
individual has a variant polymorphic form in an AAAl gene, wherein presence of the variant 
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polymorphic fonn indicates risk of asthma, other IgE-mediated disease, chronic obstructive 
pulmonary disease or cancer. 



46. The method of claim 45, wherein the variant polymorphic form occurs 
in the coding region of the AAAl gene. 

47. The mefliod of claim 34, further comprising amplifying at least part of 
SEQ ID N0:1 (AST-1) locus including the polymorphic site before the determining step, 

48. The method of claim 34, wherein the determining is performed by 
allele specific amplification, allele specific hybridization, single strand conformation 
polymorphism (SSCP), oligonucleotide ligation assay, single-base extension assay, or 
restriction fiagment length polymorphism (RFLP). 

49. A method for identifying a polymorphic site correlated with a disease 
selected fi:om the group consisting of asthma, other IgE-mediated disease, chronic obstructive 
puhnonary disease and cancer or susceptibility thereto, comprising; 

identifying a polymorphic site within a GPRA gene, 
determining whether a variant polymorphic form occupying the site is 
associated with the disease or susceptibility thereto. 

50. The method of claim 49, wherein the variant form occurs in a 
noncoding region of the GPRA gene 

5 1 . The method of claim 49, wherein the variant form occurs m a coding 
region of the GPRA gene 

52. The method of claim 49, wherein the variant form occurs between 
introns 2 and 4 of the GPRA gene. 

53 . The method of claim 49, wherein the determining comprises 
determining whether the individual has a variant form relative to SEQ ID NO: 1 (AST-1 
locus) 
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54. The method of claim 49, wherein the determining comprises 
detennining whether the individual has a variant form relative to any of SEQ ID NOS: 1, 3, 5, 
7, 9, 11, 13, and 15. 

55. The method of claim 49, wherein the detennining is performed by 
comparing the frequency of the variant polymorphic form in individuals with and without the 
disease. 

56. A primer or probe nucleic acid that hybridizes under highly stringent 
conditions to a segment of SEQ ID NO: 1 , 2 or 4 or a variant form thereof differing from SEQ 
ID NO: 1, 2 or 4 at a position shown in Table 3 or Table 7, wherein the segment includes or 
is immediately adjacent to a polymorphic site shown in Table 3 or Table 7. 

57. The primer or probe of claim 56, wherein the position is a position 
other than a position designated * in Table 7. 

58. The primer or probe of claim 56 that is perfectly complementary to a 
segment of SEQ ID NO: 1 , 2 or 4. 

59. The primer or probe of claim 56 that is perfectly complementary to a 
variant form of a segment of SEQ ED N0:1, 2 or 4 shown in Table 3 or Table 7. 

60. The prhner or probe of claim 56 that specifically hybridizes to the 
segment of SEQ. ID NO: 1, 2 or 4 without hybridizing to a corresponding segment of an 
allelic variant shown in Table 3 or Table 7. 

61. A primer of claim 56 for conducting a single-base extension reaction, 
whereby the primer is perfectly complementary to a segment that is immediately adjacent to 
but does not include the polymorphic site. 

62. A transgenic animal comprising a nucleic acid according to any one of 

claims 8-21. 

63. The transgenic animal of claim 62, fiirther comprising a nucleic acid 
encoding an AAAl polypeptide. 



64. A transgenic animal of claim 62 disposed to develop a characteristic of 
asthma, other IgE-mediated disease, chronic obstructive puhnonaiy disease or cancer in 
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which an endogenous a GPRA gene encoding a cognate form of a GPRA polypeptide defined 
by any of SEQ ID NOS: 3, 5, 7, 9, 1 1, 13 and 15 is functionally disrupted to prevent 
expression of a gene product. 

65 . The transgenic animal of claim 64 in which an endogenous AAAl 
gene encoding a cognate form of an AAAl polypeptide defined by an of SEQ ID NOS: 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41 is functionally disrupted to prevent 
expression of a gene product of the AAAl gene. 

66. A kit for use in diagnosing or assessing predisposition to asthma, other 
IgE-mediated disease, chronic obstructive pulmonary disease or cancer, comprising; 

a container; and in the container: 

a compound, preferably labeled, capable of detecting a polymorphic 
form at a polymoiphic site in a susceptibility locus for asthma as defined by SEQ ID N0:1, 2 
or 4. 

67- The kit according to claim 66, wherein the polymorphic site occurs at 
a position shown in Table 3, Table 7, Table 12, Table 13 or Table 14. 



68. The kit according to claim 66, wherein the compound is a primer or 



probe. 



69. The kit according to claim 66, wherein said primer is the primer of 



claim 56. 



70. An isolated AAAl polypeptide comprising an amino acid sequence 
that has at least 80% sequence identity to an amino acid sequence selected from the group 
consisting of SEQ. ID NOS: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and41 over the 
entire length of the selected SEQ ID No: when compared using the BLASTP algorithm with a 
wordlength (W) of 3, and the BLOSUM62 scoring matrix. 

7 1 . The isolated AAAl polypeptide of claim 70 comprising an amino acid 
sequence that has at least 90% sequence identity to the selected amino acid sequence. 

72. The isolated AAAl polypeptide of claim 70 comprising an amino acid 
sequence selected from the group consisting of SEQ ID NOS: 17, 19, 21, 23, 25, 27, 29, 31, 
33,35, 37, 39 and 41. 
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73. The isolated AAAl polypeptide of claim 70 comprising at least 10 
contiguous amino acids from an amino acid sequence selected from the group consisting of 
SEQ ID N0S:17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41. 

74. An isolated nucleic acid encoding the AAAl polypeptide of any of 

claims 70-73. 

75. The isolated nucleic acid of claim 74 that hybridizes under highly 
stringent conditions to any of SEQ ID NOS: 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 
40, wherein the highly stringent conditions are 6 x NaCl/sodium citrate (SSC) at about 45 ^'C 
for a hybridization step, followed by a wash of 2 x SSC at 50 

76. An isolated nucleic acid of claim 74 having a sequence that is at least 
80% identical to a nucleic acid having a sequence selected from the group consisting of SEQ 
ID NO: 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40 over the entire length of the 
selected SEQ ID NO when compared using the BLASTN algorithm with a wordlength (W) of 
ll,M=5,andN=-4. 

77. The isolated nucleic acid of claim 76, wherein at least one polymorphic 
site shown in Table 12 is occupied by a variant nucleotide. 

78. An isolated nucleic acid having at least 20 contiguous nucleotides from 
a sequence selected from the group consisting of SEQ ID NOS: 16, 18, 20, 22, 24, 26, 28, 30, 
32, 34, 36, 38 and 40. 

79. An isolated genomic DNA molecule or a minigene having at least one 
intronic sequence and encoding an AAAl polypeptide that has at least at least 80 % sequence 
identity to an amino acid sequence selected from the group consisting of SEQ. ID NOS: 17, 
19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41 over tiie entire length of the selected SEQ ID 
NO. when compared using the BLASTP algorithm with a wordlength (W) of 3, and the 
BLOSUM62 scoring matrix . 

80. The isolated genomic DNA molecule or minigene of claim 79, wherein 
the polypeptide has a sequence seleted from the group consisting of SEQ. ID NOS: 17, 19, 
21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41. 
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8 1 . The isolated nucleic acid, genomic DNA molecule or minigene of 
claim 79 or 80 linked to a second nucleic acid with which it is not naturally associated. 

82. The isolated nucleic acid, genomic DNA molecule or minigene of 
claim 79, wherein the second nucleic acid includes a heterologous promoter operably linked 
to a gene within the isolated nucleic acid. 

83. A vector comprising the isolated nucleic acid, genomic molecule or 
minigene nucleic acid of any of claims 74-82. 

84. A host cell comprising the vector of claim 83 . 

85. An antibody that specifically binds to a polypeptide selected fi-om the 
group consisting of SEQ ID N0S:17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41. 

86. A method of preventing or treating asthma, other IgE-mediated 
disease, chronic obstructive pulmonary disease or cancer, comprising 

administering to a patient suffering from or at risk of asthma, other IgE- 
mediated disease, chronic obstructive puhnonary disease or cancer an effective amount of a 
modulator of an AAAl polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ. ID NOS: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41. 

87. The method of claim 86, wherein the modulator binds to the AAAl 

polypeptide. 

88. The method of claim 86, wherein the modulator inhibits or induces 
expression of the AAAl polypeptide. 

89. A method of identifying a modulator of an AAAl polypeptide, 

comprising 

contacting an AAAl polypeptide with an agent; 

determining whether the agent binds to the AAl polypeptide, 
modulates expression of the AAAl polypeptide or modulates activity of the AAAl 
polypeptide, wherein the AAAl polypeptide comprises an amino acid sequenced as defined 
by any of SEQ ID NOS: 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39 and 41. 
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90. The mefliod of claim 89, wherein the AAAl polypeptide is expressed 

from a cell. 

91. A method of determining risk of asthma, other IgE mediated disease, 
chronic obstructive pulmonary disease or cancer, comprising; 

determining whether the individual has a variant polymorphic form in 
an AAAl gene, wherein presence of the variant polymorphic form indicates risk of asthma, 
other IgE mediated disease, chronic obstructive pulmonary disease or cancer. 

92. The method of claim 91 , wherein the variant form occurs in a 
noncoding region of the AAAl gene 

93 . The method of claim 9 1 , wherein the variant form occurs in a coding 
region of the AAAl gene 

94. The method of claim 9 1 , wherein the determining comprises 
determining whether the individual has a variant form relative to SEQ ID NO: 1 (AST-1 
locus) 

95. The method of claim 91, wherem the determining comprises 
determining whether the individual has a variant form relative to any of SEQ ID NOS: 16, 18, 
20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40. 

96. The method of claim 91, wherein the variant form is a variant form 
shown in Table 12. 

97. The method of claim 9 1 , wherein the determining is performed by 
allele specific amplification, allele specific hybridization, single strand conformation 
polymorphism (SSCP), oligonucleotide ligation assay, single-base extension assay, or 
restriction firagment length polymorphism (RFLP). 

98. A method for identifying a polymorphic site correlated with a disease 
seleceted from the group consisting of asthma, other IgE mediated disease, chronic 
obstructive pulmonary disease or cancer or susceptibility thereto, comprising; 

identifying a polymorphic site within an AAAl gene, 
determining whether a variant polymorphic form occupying the site is 
associated with the disease or susceptibility thereto. 
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99. The method of claim 98, wherein the variant form occurs in a 
noncoding region of the AAAl gene. 

1 00. The method of claim 98, wherein the variant form occurs in a coding 
region of the AAAl gene 

101. The method of claim 98, wherein the determining comprises 
determining whether the individual has a variant form relative to SEQ ID NO: 1 (AST-1 
locus) 

1 02. The method of claim 98, wherein the determining comprises 
determining whether the individual has a variant form relative to any of SEQ ID NOS: 16, 18, 
20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40. 

103. The method of claim 98, wherein the determining is performed by 
comparing the frequency of the variant polymorphic form in individuals with and without the 
disease. 

104. A primer or probe nucleic acid of nucleotides that hybridizes under 
highly stringent conditions to a segment of SEQ ID NO: 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 
36, 38 and 40 or a variant form thereof differing from SEQ ID N0S.16, 18, 20, 22, 24, 26, 
28, 30, 32, 34, 36, 38 and 40 at a single polymorphic site. 

105. The primer of probe of claim 104, wherein the polymorphic site is one 
shown in Table 3 or 12. 

106. The primer or probe of claim 104 that is perfectly complementary to a 
segment of SEQ ID N0S:16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 38 and 40. 

107. A primer or probe of claim 104 for conducting a single-base extension 
reaction, whereby the primer is perfectly complementary to a segment that is immediately 
adjacent to but does not include the polymorphic site. 

108. A transgenic animal comprising a nucleic acid according to any one of 

claims 74-82. 

109. A transgenic animal of claim 108 disposed to develop a characteristic 
of asthma, other Ig^-mediated disease, chronic obstructive puhnonary disease or cancer in 
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which an endogenous a AAAl gene encoding a cognate form of an AAAl polypeptid 
defined by any of SEQ ID NOS: 17, 19, 21, 23, 25. 27, 29, 31, 33, 35, 37, 39 and 41 i 
functionally disrupted to prevent expression of a gene product. 



e 

IS 



1 10. A pharmaceutical composition conaprising an effective amount of a 
GPRA polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ. ID NOS: 3, 5, 7, 9, 1 1, 13, and 15 or a modulator thereof for the prevention or 
treatment of asthma or other IgE mediated disease, chronic obstructive puhnonaiy disease or 
cancer. 



1 1 1 . The pharmaceutical composition of claim 1 10, further comprising an 
effective amount of an AAAl polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NOS: 17, 19, 21, 23, 25, 27, 29. 31, 33. 35. 37, 39, and41 or 
a modulator thereof 

1 12. The pharmaceutical composition of claim 1 10. wherein the modulator 
binds to the GPRA polypeptide and/or activates/inhibits expression of the GPRA polypeptide. 

1 13. The pharmaceutical composition of claim 111. wherein the modulator is a 
transcript of an AAAl gene or an activator/inhibitor of expression of an AAAl gene. 

1 14. The pharmaceutical composition of claim 1 1 1, wherein the modulator 
binds to the AAAl polypeptide and/or activates/inhibits expression of the AAAl polypeptide. 

1 15. Use of a GPRA polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ. ID NOS: 3, 5, 7, 9. 1 1. and 13 or a modulator thereof for 
the manufecture of a medicament for the prevention or treatment of asthma or other IgE 
mediated disease, chronic obstructive puhnonaiy disease or cancer. 

1 16. The use of claim 1 15, wherein a GPRA polypeptide comprising an amino 
acid sequence selected from the group consisting of SEQ. ID NOS: 3, 5, 7, 9, 1 1. 13, and 15 
or a modulator thereof and an AAAl polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ E> NOS: 17. 19, 21, 23, 25. 27, 29, 31. 33, 35, 37, 
39, and 41 or a modulator thereof are used for the manufecture of said medicament. 
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1 17. A pharmaceutical composition comprising an effective amount of an 
AAAl polypeptide comprising an amino acid sequence selected &om the group consisting of 
SEQ ID NOS: 17, 19, 21, 23, 25, 27, 29, 3 1, 33, 35, 37, 39, and 41 or a modulator thereof for 
the prevention or treatment of asthma or other Ig£ mediated disease, chronic obstructive 
pulmonary disease or cancer. 

118. The pharmaceutical composition of claim 1 17, wherein the modulator 
binds to the AAAl polypeptide and/or activates/inhibits expression of the AAAl polypeptide. 

1 19. Use of an AAAl polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NOS: 17, 19, 21, 23, 25, 27, 29, 3 1, 33, 35, 37, 
39, and 41 or a modulator thereof for the manufacture of a medicament for the prevention or 
treatment of asthma or other Ig£ mediated disease, chronic obstructive pulmonary disease or 
cancer. 
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Figure 4A 

A SEQ ID NOs: 2 and 3 (6B AY310326) 

gggctcag 

ggagggctctgtgcctccgttcagcagagctgcagctgctgcccagctctcaggaggcaa 
gctggactccctcactcagctgcaggagcaaggacagtgaggctcaaccccgcctgagcc 
atgccagccaacttcacagagggcagcttcgattccagtgggaccgggcagacgctggat 

MPANFTEGSPDSSGTGQTLD 
tcttccccagtggcttgcactgaaacagtgacttttactgaagtggtggaaggaaaggaa 

SSPVACTETVTFTEVVBGKB 
tggggttcct tctactactcctttaag | actgagcaattgataactctgtgggtcctcttt 

WGSPYYSFKTEQLITIiWVLP 

gtttttaccattgttggaaactccgttgtgcttttttccacatggaggagaaagaagaag 

VFTIVGNSVVLPSTWRRKKK 

tcaagaatgaccttctttgtgactcagctggccatcacag | attctttcacaggactggtc 

SRMTFFVTQLAITDSFTGLV 
aacatcttgacagatattaattggcgattcactggagacttcacggcacctgacctggtt 

NILTDINWRPTG DPTAPDLV 
tgccgagtggtccgctatttgcag | gttgtgctgctctacgcctctacctacgtcctggtg 

CRVVRYLQVVLLYASTYVLV 
tccctcagcatagacagataccatgccatcgtctaccccatgaagttccttcaaggaglaa 

SLSIDRYHAIVYPMKFLQGE 
aagcaagccagggtcctcattgtgatcgcctggagcctgtcttttctgttctccattccc 

KQARVLIVIAWSIiSFLPSIP 
accctgatcatatttgggaagaggacactgtccaacggtgaagtgcagtgctgggccctg 

TLIIFGKRTLSNGEVQCWAL 
tggcctgacgactcctactggaccccatacatgaccatcgtggccttcctggtgtacttc 

WPDDSYWTPYMTIVAFLVYF 
atccctctgacaatcatcag | catcatgtatggcattgtgatccgaactatttggattaaa 

IPLTIISIMYGIVIRTIWIK 

agcaaaacctacgaaacagtgatttccaactgctcag | atgggaaactgtgcagcagctat 

SKTYETVISNCSDGKLCSSY 
aaccgaggactcatctcaaaggcaaaaatcaaggctatcaagtatagcatcatcatcatt 

NRGLISKAKIKAIKYSIIII 
cttg I ccttcatctgctgttggagtccatacttcctgtttgacattttggacaatttcaac 

liAPICCWSPYFLFDILDNFN 
ctccttccagacacccaggagcgtttctatgcctctgtgatcattcagaacctgccagca 

LIiPDTQERFYASVI IQNLPA 

ttgaatagtgccatcaaccccctcatctactgtgtcttcagcagctccatctctttcccc 
LNSAINPLIYCVPSSSISFP 

tgcag | ggagcaaagatcacaggattccagaatgacgttccgggagagaactgagaggcat 
CREQRSQDSRMTFRERTERH 

gagatgcagattctgtccaagccagaattcatctagaccctagggcagtgccagtgctag 
EMQILSKPEFI* 

gctgagcaccatcagctctcccaggtccttgtcacctgcttgggcacgtgcatggaaccc 
gagccaacttcaccccaccctcgtcattacctgggagatgcacaagacaaatgttctaat 
gactgcatgcactgcttaagtattggccaacacgaactccccagttattcatgccagcca 
ggaaggaaacgccttccttccccaccattcccagccctccttcccactggccagcacctg 
aacccagtgaacacaggcattagtggtccagggtcctggcttggagccagtgagtagac 
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Figure 4B1 

B long SEQ ID NOS: 4 and 5 (GB AY310327) 

gggctcag 

ggagggctctgtgcctccgttcagcagagctgcagctgctgcccagctctcaggaggcaa 
gctggactccctcactcagctgcaggagcaaggacagtgaggctcaaccccgcctgagcc 
atgccagccaacttcacagagggcagcttcgattccagtgggaccgggcagacgctggat 

MPANFTEGSFDSSGTGQTLD 
tcttccccagtggcttgcactgaaacagtgacttttactgaagtggtggaaggaaaggaa 

SSPVACTETVTFTEVVEGKE 
tggggttcct tctactactcctttaag | actgagcaattgataactctgtgggtcctcttt 

WGSFYYSPKTEQLITLWVIiF 
gtttttaccattgttggaaactccgttgtgcttttttccacatggaggagaaagaagaag 

VFTIVGNSVVLPSTWRRKKK 

tcaagaatgaccttctttgtgactcagctggccatcacag | attctttcacaggactggtc 
SRMTPFVTQLAITDSFTQLV 

aacatcttgacagatattaattggcgattcactggagacttcacggcacctgacctggtt 
NILTDINWRFTGDPTAPDLV 

tgccgagtggtccgctatttgcag | gttgtgctgctctacgcctctacctacgtcctggtg 

CRVVRYLQVVLLYASTYVLV 
tccctcagcatagacagataccatgccatcgtc taccccatgaagttcctt caaggag | aa 

SLSIDRYHAIVYPMKFLQGE 
aagcaagccagggtcctcattgtgatcgcctggagcctgtcttttctgttctccattccc 

KQARVLIVIAWSLSFLFSIP 
accctgatcatatttgggaagaggacactgtccaacggtgaagtgcagtgctgggccctg 

TLIIPGKRTLSNGEVQCWAL 
tggcctgacgactcctactggaccccatacatgaccatcgtggccttcctggtgtacttc 

WPDDSYWTPYMTIVAFLVYF 
atccctctgacaatcatcag | catcatgtatggcattgtgatccgaactatttggattaaa 

IPLTIISIMYGIVIRTIWIK 
agcaaaacctacgaaacagtgatttccaactgctcag | atgggaaactgtgcagcagctat 

SKTYETVISNCSDGKLCSSY 
aaccgaggactcatctcaaaggcaaaaatcaaggctatcaagtatagcatcatcatcatt 

NRGLISKAKIKAIKYSIIII 
cttg I ccttcatctgctgttggagtccatactt cctgtttgacattttggacaatttcaac 

IiAFICCWSPYFLFDILDNFN 
ctccttccagacacccaggagcgtttctatgcctctgtgatcattcagaacctgccagca 

LLPDTQBRPYASVI IQNLPA 

ttgaatagtgccatcaaccccctcatctactgtgtcttcagcagctccatctctttcccc 

LN SAINPLIYCVFSSS ISPP 
tgcag | ggtcatccgtctccgtcagctccaggaggctgcgctaatgctctgccctcaacga 

CRVIRLRQLQEAALMLCPQR 

gagaactggaagggtacttggccaggtgtaccttcctgggctcttccaaggtgacagctc 
ENWKGTWPGVPSWALPR* 

tcaccctgtgctgcaggtggccctgtgcctggtgccacttctcactgcttaccagggcac 

aaggacaccagtggttcccaaaatgggtcacagcaggatggcctgcatcagattcaccag 
ggagggctataagaaggcagac 
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Figure 4B2 

B short SEQ ID NOS: 6 and 7 (6B AY310328) 

gggctcag 

ggagggctctgtgcctccgttcagcagagctgcagctgctgcccagctctcaggaggcaa 
gctggactccctcactcagctgcaggagcaaggacagtgaggctcaaccccgcctgagcc 
atgccagccaacttcacagagggcagcttcgattccagtgggaccgggcagacgctggat 
MPANFTEGSFDSSGTGQTLD 

tcttccccagtggcttgcactgaaacagtgacttttactgaagtggtggaaggaaaggaa 
SSPVACTETVTFTEVVEGKE 

tggggttccttctactactcctttaag | actgagcaattgataactctgtgggtcctcttt 
WGSFYYSPKTEQLITLWVIiP 

gtttttaccattgttggaaactccgttgtgcttttttccacatggaggagaaagaagaag 
VPTIVGNSVVLFSTWRRKKK 

tcaagaatgaccttctttgtgactcagctggccatcacagjatattaattggcgattcact 

SRMTPFVTQLAITDINWRFT 
ggagacttcacggcacctgacctggtttgccgagtggtccgctatttgcag | gttgtgctg 

GDFTAPD LVCRVVRYLQVVIi 
ctctacgcctctacctacgtcctggtgtccctcagcatagacagataccatgccatcgtc 

LYASTYVLVSLS IDRYHAIV 
taccccatgaagttccttcaaggag | aaaagcaagccagggtcctcattgtgatcgcctgg 

YPMKFLQGEKQARVLIVIAW 
agcctgtcttttctgttctccattcccaccctgatcatatttgggaagaggacactgtcc 

SLSFIiPSIPTLIIPGKRTLS 
aacggtgaagtgcagtgctgggccctgtggcctgacgactcctactggaccccatacatg 

NQEVQCWALWPDDSYWTPYM 
accatcgtggccttcctggtgtacttcatccctctgacaatcatcag | catcatgtatggc 

TIVA.FLVYFIPLTIISIMYG 
attgtgatccgaactatttggattaaaagcaaaacctacgaaacagtgatttccaactgc 

IVIRTIWIKSKTYETVISNC 
tcag I atgggaaactgtgcagcagctataaccgaggactcatctcaaaggcaaaaatcaag 

SDGKLCSSYNRGLISKAKIK 

gctatcaagtatagcatcatcatcattcttg | ccttcatctgctgttggagtccatacttc 
AIKYSIIIIXiAPICCWSPYF 

ctgtttgacattttggacaatttcaacctccttccagacacccaggagcgtttctatgcc 
LFDIIiDNFNLLPDTQERPYA 

tctgtgatcattcagaacctgccagcattgaatagtgccatcaaccccctcatctactgt 

SVIIQNLPALNSAINPLIYC 

gtcttcagcagctccatctctttcccctgcag | ggtcatccgtctccgtcagctccaggag 
VFSSSISFPCRVIRLRQLQE 

gctgcgctaatgctctgccctcaacgagagaactggaagggtacttggccaggtgtacct 

AALMLCPQRENWKGTWPGVP 

tcctgggctcttccaaggtgacagctctcaccctgtgctgcaggtggccctgtgcctggt 
S W A L P R * 

gccacttctcactgcttaccagggcacaaggacaccagtggttcccaaaatgggtcacag 
caggatggcctgcatcagattcaccagggagggctataagaaggcagac 



wo 2004/056866 



PCT/FI2003/000973 



11/42 



Figure 4C 

C SEQ ID NOS: 8 and 9 (6B AY310329) 

gggctcag 

ggagggctctgtgcctccgttcagcagagctgcagctgctgcccagctctcaggaggcaa 
gctggactccctcactcagctgcaggagcaaggacagtgaggctcaaccccgcctgagcc 
atgccagccaacttcacagagggcagcttcgattccagtgggaccgggcagacgctggat 

MPANFTEGSFDSSGTGQTLD 
tcttccccagtggcttgcactgaaacagtgacttttactgaagtggtggaaggaaaggaa 

SSPVACTETVTPTEVVEGKB 

tggggttccttctactactcctttaag | actgagcaattgataactctgtgggtcctcttt 

WGSFYYSFKTEQLITLWVIiP 
gtttttaccattgttggaaactccgttgtgcttttttccacatggaggagaaagaagaag 

VPTIVGNSVVLFSTWRRKKK 

tcaagaatgaccttctttgtgactcagctggccatcacag | tataacaagcccacctgctt 

SRMTFFVTQLAITV* 

gagctgggctgcagtggccagggtaaacatccaaggcaccagtgaaaaatacagagaagg 

taaaaggagcaagagttctgaagatggaacctgggatgggggaaagtttcttcaatcttt 

cctaccaacaagaactccaatttttcactcctataaccgtagaagtagaggtaattagga 

tcatccagcaaatgcttagaggcaaatatccctggatgaggatgccacagcttattttca 

ttatatttcttcgattacagtgtggtaatgcatgttgtatggaactacatattctttcag 

aatgaaaggatttagaggtggcaagaatatcagcttgaaatttaaagttttttcataaac 
aataaacaaatgataattgaaaattc 



* 
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Figiire 4D 

D SEQ ID NOS: 10 and 11 (GB AY310330) 

gggctcag 

ggagggctctgtgcctccgttcagcagagctgcagctgctgcccagctctcaggaggcaa 
gctggactccctcactcagctgcaggagcaaggacagtgaggctcaaccccgcctgagcc 
atgccagccaacttcacagagggcagcttcgattccagtgggaccgggcagacgctggat 

MPANPTEGSPDSSGTGQTLD 
tcttccccagtggcttgcactgaaacagtgacttttactgaagtggtggaaggaaaggaa 
SSPVACTETVTFTEVVEGKB 

tggggttccttctactactcctttaag | actgagcaattgataactctgtgggtcctcttt 

WGSFYYSFKTEQLITLWVLP 

gtttttaccattgttggaaactccgttgtgctfcttttecacatggaggagaaagaagaag 
VFTIVGNSVVIiFSTWRRKKK 

tcaagaatgaccttctttgtgactcagctggccatcacagjgttgtgctgctctacgcctc 
SRMTFFVTQLAITGCAALRL 

tacctacgtcctggtgtccctcagcatagacagataccatgccatcgtctaccccatgaa 

YLRPGVPQHRQIPCHRLPHE 

gttccttcaaggag | aaaagcaagccagggtcctcattgtgatcgcctggagcctgtcttt 

VPSRRKASQGPHCDRLEPVP 

tctgttctccattcccaccctgatcatatttgggaagaggacactgtccaacggtgaagt 
SVLHSHPDHIWEEDTVQR* 

gcagtgctgggccctgtggcctgacgactcctactggaccccatacatgaccatcgtggc 

cttcctggtgtacttcatccctctgacaatcatcagcatcatgtatggcattgtgatccg 

aactatttggattaaaagcaaaacctacgaaacagtgatttccaactgctcagatgggaa 

actgtgcagcagctataaccgaggactcatctcaaaggcaaaaatcaaggctatcaagta 

tagcatcatcatcattcttgccttcatctgctgttggagtccatacttcctgtttgacat 

tttggacaatttcaacctccttccagacacccaggagcgtttctatgcctctgtgatcat 

tcagaacctgccagcattgaatagtgccatcaaccccctcatctactgtgtcttcagcag 

ctccatctctttcccctgcagggagcaaagatcacaggattccagaatgacgttccggga 

gagaactgagaggcatgagatgcagattctgtccaagccagaattcatctagaccctagg 

gcagtgccagtgctaggctgagcaccatcagctctcccaggtccttgtcacctgcttggg 

cacgtgcatggaacccgagccaacttcaccccaccctcgtcattacctgggagatgcaca 

agacaaatgttctaatgactgcatgcactgcttaagtattggccaacacgaactccccag 

ttattcatgccagccaggaaggaaacgccttccttccccaccattcccagccctccttcc 

cactggccagcacctgaacccagtgaacacaggcattagtggtccagggtcctggcttgg 
agccagtgagtagac 
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Figure 4E 

E SEQ ID NOS: 12 and 13 (GB AY310331) 

gggctcag 

ggagggctctgtgcctccgttcagcagagctgcagctgctgcccagctctcaggaggcaa 
gctggactccctcactcagctgcaggagcaaggacagtgaggctcaaccccgcctgagcc 
atgccagccaacttcacagagggcagcttcgattccagtgggaccgggcagacgctggat 
MPANFTEGSFDSSGTGQTLD 

tcttccccagtggcttgcactgaaacagtgacttttactgaagtggtggaaggaaaggaa 

SSPVACTETVTFTEVVEGKE 
tggggttccttctactactcctttaag | actgagcaattgataactctgtgggtcctcttt 

WGSFYYSFKTEQLITLWVLF 
gtttttaccattgttggaaactccgttgtgcttttttccacatggaggagaaagaagaag 
VFTIVGNSVVLFSTWRRKKK 

tcaagaatgaccttctttgtgactcagctggccatcacag|attctttcacaggactggtc 

SRMTFFVTQLAITDSPTGLV 
aacatcttgacagatattaattggcgattcactggagacttcacggcacctgacctggtt 

NILTDINWRFTGDFTAPDLV 
tgccgagtggtccgctatttgcag | aaaagcaagccagggtcctcattgtgatcgcctgga 

CRVVRYLQKSKPGSSL* 

gcctgtcttttctgttctccattcccaccctgatcatatttgggaagaggacactgtcca 

acggtgaagtgcagtgctgggccctgtggcctgacgactcctactggaccccatacatga 

ccatcgtggccttcctggtgtacttcatccctctgacaatcatcagcatcatgtatggca 

ttgtgatccgaactatttggattaaaagcaaaacctacgaaacagtgatttccaactgct 

cagatgggaaactgtgcagcagctataaccgaggactcatctcaaaggcaaaaatcaagg 

ctatcaagtatagcatcatcatcattcttgccttcatctgctgttggagtccatacttcc 

tgtttgacattttggacaatttcaacctccttccagacacccaggagcgtttctatgcct 

ctgtgatcattcagaacctgccagcattgaatagtgccatcaaccccctcatctactgtg 

tcttcagcagctccatctctttcccctgcagggagcaaagatcacaggattccagaatga 

cgttccgggagagaactgagaggcatgagatgcagattctgtccaagccagaattcatct 

agaccctagggcagtgccagtgctaggctgagcaccatcagctctcccaggtccttgtca 

cctgcttgggcacgtgcatggaacccgagccaacttcaccccaccctcgtcattacctgg 

gagatgcacaagacaaatgttctaatgactgcatgcactgcttaagtattggccaacacg 

aactccccagttattcatgccagccaggaaggaaacgccttccttccccaccattcccag 

ccctccttcccactggccagcacctgaacccagtgaacacaggcattagtggtccagggt 
cctggcttggagccagtgagtagac 



wo 2004/056866 



PCT/FI2003/000973 



14/42 



Figure 4F 

F SEQ ID NOS: 14 and 15 (GB AY310332) 

gggctcag 

ggagggctctgtgcctccgttcagcagagctgcagctgctgcccagctctcaggaggcaa 
gctggactccctcactcagctgcaggagcaaggacagtgaggctcaaccccgcctgagcc 
atgccagccaacttcacagagggcagcttcgattccagtgggaccgggcagacgctggat 
MPANFTEGSFDSSGTGQTLD 

tcttccccagtggcttgcactgaaacagtgacttttactgaagtggtggaaggaaaggaa 

SS PVACTETVTPTEVVEGKE 
tggggttccttctactactcctttaag | actgagcaattgataactctgtgggtcctcttt 

WGSPYYSPKTEQLITLWVLP 
gtttttaccattgttggaaactccgttgtgcttttttccacatggaggagaaagaagaag 

VFTIVGNSVVLFSTWRRKKK 
tcaagaatgaccttct t tgtgactcagctggccatcacag | aaaagcaagccagggtcctc 

SRMTPFVTQLAITEKQARVL 

attgtgatcgcctggagcctgtcttttctgttctccattcccaccctgatcatatttggg 

IVIAWSLSFLFSIPTLI IPG 
aagaggacactgtccaacggtgaagtgcagtgctgggccctgtggcctgacgactcctac 

KRTLSNGEVQCWALWPDDSY 
tggaccccatacatgaccatcgtggccttcctggtgtacttcatccctctgacaatcatc 

WTPYMTIVAFLVYPIPLTII 

ag I catcatgtatggcattgtgatccgaactatttggattaaaagcaaaacctacgaaaca 

S IMYGIVIRTIWIKSKTYET 
gtgatttccaactgctcag | atgggaaactgtgcagcagctataaccgaggactcatctca 

VI SNCSDGKLCSSYNRGLIS 
aaggcaaaaatcaaggctatcaagtatagcatcatcatcattcttg | ccttcatctgctgt 

KAKIKAIKYSIIIILAFICC 
tggagtccatacttcctgtttgacattttggacaatttcaacctccttccagacacccag 

WS PYPLPDILDNFNLLPDTQ 
gagcgtttctatgcctctgtgatcattcagaacctgccagcattgaatagtgccatcaac 

ERFYASVIIQNLPALNSAIN 
cccctcatctactgtgtcttcagcagctccatctctt tcccctgcag | ggagcaaagatca 

PLIYCVFSSSISFPCREQRS 
caggattccagaatgacgttccgggagagaactgagaggcatgagatgcagattctgtcc 

QDSRMTFRERTBRHBMQILS 

aagccagaattcatctagaccctagggcagtgccagtgctaggctgagcaccatcagctc 
K P E P I * 

tcccaggtccttgtcacctgcttgggcacgtgcatggaacccgagccaacttcaccccac 

cctcgtcattacctgggagatgcacaagacaaatgttctaatgactgcatgcactgctta 

agtattggccaacacgaactccccagttattcatgccagccaggaaggaaacgccttcct 

tccccaccattcccagccctccttcccactggccagcacctgaacccagtgaacacaggc 
attagtggtccagggtcctggcttggagccagtgagtagac 
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MPANFTEGSFDSSGTGQTLDSSPVACTETVTFTEWEGKEWGSFYYSFKT 

TMl CYTOIiOOPl TM2 

EQL ITLWLFVFTIVGNSVVLFSTWR RKKKSRM TFFVTQLAITDSFTGLV 

BXOLOOPl XM3 
NILTDINWRFTGDFTAPDLVCRWRYLQWLLYASTYVLVSLS IDRYHAI 



CYT0L00P2 TM4 EXOXiOOP2 

VYPMKFLQGEKQA RVLI VIAWSLSFLFS I PTLI IFG KRTLSNGEVQCWAL 

TM5 CYT0L00P3 

WPDDSYWTP YMTIVAFLVYFIPLTIISIMYGIVIR TIWIKSKTYETVISN 

TM6 

CSDGKLCSSYNRGLISKAKIKAIKYSIIIILAFICCWSPYFLFDILDNFN 



EX0L00P3 TM7 

llpdtqerfyasviiqnlpalnsainpliycvfsssisfpCreqrsqdsr 



MTFRERTERHEMQILSKPEFI * 



Figure 6A 
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^long 



MPANFTEGSPDSSGTGQTIiDSSPVACTETVTFTEWEGKEWGSFYYSFKT 

TMl CYTOLOOPl TtS2 

EQ LITLWVLPVPTIVGNSVVLPSTWR RKiaCSRM TFFVTQIJUITDS^ 

EXOIiOOPl *EM3 
NILTDINWRFTGDFTAPPLVC RVVRYLQVVIjLYASTYVLVSLSID RYHAI 

CYT0L00P2 TM4 EX0L00P2 

VYPMKFLQGEKQA RVLIVIAWSLSPLFSIPTLIIFGK RTLSNGEVOCWAL 

TM5 CYTOI.OOP3 
WPDDSYWTP YMTIVAFLVYFIPLTIISIMYGIVIR TIWIKSKTYETVISN 

TM6 

CSDGKLCSSYNRGLISKAKIKAI KYSIIIILAFICCWSPYFLFDIL DNFN 
EXOLOOP3 

LLPDTQERPYASVI IQNLPALNSAINPLIYCVFSSSISFPCRVIRLROLO 



EAAIiMLCPQRENWKGTWPGVPSWALPR* 



^short (33 bp deletion in exon 3) 



MPANFTEGSFDSSGTGQTLDSSPVACTETVTFTEWEGKEWGSF 
YYSFKTBQ LITLWVLFVFTIVGNSVVLFSTWR RKKKSRMTFFVT 
QLAITPINWRFTGDFTAPDLVCRW RYLQWIiLYASTYVLVSLS I 
DRYHAIVYPMKFIiQGEKQAR VLIVIAWSLSFLFSIPTLIIFGK RTLS 
NGEVQCWALWPDDS YWTPY MTI VAFLVYFI PLTI I S IM YGI VIRTI 

WIKSKTYETVISKICSDGKLCSSYNRGLISKAKIKAIKY SIIIIIiAFIC 
CWSPYFIjFD ILDNFNLLPDTOERPYASVI IQNIiPAIiN SAINPLIYC 
VFSSSISFPCRVIR LRQLQEAALMLCPQRENWKQTWPgVP 
SWALPR* 

Figure 6B 
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MPANFTEGSFDSSGTGQTLDSSPVACTETVTFTEWEGKEWGSFYYSFKT 
EQLI TLWVLFVFTIVGNSVVLFSTWR RKKKSRMTFFVTQLAITV* 



D 



MPANFTEGSFDS SGTGQTLDS S PVACTETVTFTE WEGKEWGSFYYSFKT 
EQ LITLPJVLFVFTIVGNSWLFSTWR RKKKSR MTFFVTQLAITGCAALRL 

YLRPGVPQHRQIPCHRLPHEVPSRRKASQGPHCDRLEPVFSVLHSHPDHI 
WEEDTVQR^ 



Figure 6C and 6D 
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E 



MPANFTEGSFDSSGTGQTLDSSPVACTETVTFTEWEGKEWGSFYYSFKTEQ LITLWVLF 

VFTIVGNSWLFSTW RRKKKSRM TFFVTQLAITDSFTGLVNILT DINWRFTGDFTAPDLVC 
RWRYLQKSKPGSSL* 



F 



MPANFTEGSFDSSGTGQTLDSSPVACTETVTFTEWEGKEWGSFYYSFKTEQL ITLWVLF 

VFTIVGNSWLFSTW RRKKKSRMTFFVTQLAITEKQA RVLIVIAWSLSFLFSIPTLIIFG 
KRTLSNGEVQCWALWPDDS YWTP YMT I VAPLVYFIPLTI I S IMYGIVI RTIWI KS KTYET 
VI SNCSDGKLCS S YNRGL I S KAKI KAI KYS 1 1 1 ILAFI CCWSPYFLFDIL DNFNLLPDTQ 

ERFYAS VIIQNLPALNSAINPLIYCVF SSSISFPCREORSODSRMTFRRRTIgPWRM^TT.g 
KPBPI* 



Figure 6E and 6F 
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Figure 9 
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Fig. lOA, Fig. lOB, Fig. IOC, Fig. lOD, Fig. lOE, Fig. lOF 
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Fig. llA,Fig. llB,Fig. lie, Fig. llD,Fig. HE, Fig. IIF 
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Fig. 12Aand 12B 
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Figure 1 7 
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Figure 20 

I exons 2, A, 6, 1, B, 9 6B AY312364 SEQ ID NOS: 16 and 17 

gacacagagaagaccactcccacctccccgagtgcaaggtgtgaagggacagatcttttaac 
catgcctgcccccttatacttgctgttcatagaattgcaactgaaagtgaccatgaggat 
ccactggatggagttacttctttcttaagtgaggaggctaagatctggagtgacttctcc 
ccagatttttgtatacctgactctgtttcagcatccgcttcccaaagaatgcagtgtgaa 

M Q C E 

gcaggagcttatgtgagaagaaacgcagggagacagttcagtcactgcaatcttcatgcc 

A6AYVRRNA6RQFSHCNLHA 
catcagtttcttgtgagaagaaaacaagagctccaatctgttagatggtattttgaagca 
HQFLVRRKQELQSVRWYPEA 

ggtctttgggtaaaggacacctagacccagtgaaggtcatggtgattattattggacaat 

Q L W V K D T 

gggacatcactctgctatttgaacaaataagactttttcctgacatgcatctggaggcag 
gtcaaagctccaggccaactccaagtttctgatggggtctctagccaatggaaggcttct 
tctccttcaattgcctgactcttcaggactcttaaatactgcaaagtgagaaaatgagac 
aggttgcactgagggctgttagccagacagagtctcgaactggaagtccatctagatgtt 
ttgcataagagaatggaaacaatctgtctgtgatttagggacatactctggcagcaatat 
gggaatacagtttcaatcctcattaacaaaacaggtatgaaatacatatttatttagtaa 
ggtgccagctgtatgaaaaatccatttcttatttcccataatgtttctgaaatgtcttag 
cagtgcatagagacagcatgtcatcattttctagggactgtgtgttattgcatttttcct 
agggaagatcttttctaggtcacctgctccttcgctaaagctctgaccaatctagcttgc 
taacctgtgactccattttcctaagtcctgagagagaaaaacgctttgcagcaaattatg 
ccaggcatccttgtgtctaaatgaaaaaaggaaaaagcctccttccttccctctgttgag 
aagtgcacggtccacatatgcatgcacagcatatactgtgagggtatttgcagtcccttg 
ggttgctttgataactggccaggttgctgttctatttttccacattctattaatcctcct 
acaggcagttattaggtattgagtgctcacacacccctggcatagtcaccacatgccatt 
agctccagataaacttccagaaaaaagtccatcccccacttctctcagctgcctgccaac 
gctggacaccttctcaccaagccagcaggacagagaaaagcctgggctttaagatcaaac 
aaacacagcttcaaattaggactctgtcacttcctgtgtactgggcactttgctgagtat 
gtggtttctcatctgtaaaacagagaaagatgattatctcccaatctttctatgttatat 
gtttgaattaaataaggtactctccatgaa 
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Figure 20 (continued) 



m exons 5, 6, 10a, 13, 15, 16 GB AY 312367 S£Q ID NOS: 22 and 23 

tctaggactcagaaatatagatgttagtaagagcaaacagacataacagataacacatac 
aaagtgcctaccacatgctaaccactgctgcaggcactttctatagaagaactaatttaa 
tcatcaccataaccctatggggtagatgatatttttacaacctccattttacagatgaag 

M K 

aaactgaagcatagacctgcttatgtgagaagaaacgcagggagacagttcagtcactgc 

KL'KHRPAYVRRNAGRQFSHC 
aatcttcatgcccatcagtttcttgtgagaagaaaacaagaaaacaaggactgaaatcca 

NLHAHQFLVRRKQENKD 
cacaggaaggtggcagtgaactccacagacggacctggacgcctcaacactcctggcctt 
acctcccttgctgaacgtctcaagtttctctgcgttcaggactggcaacgcctgcttcct 
cctctgagctgtcaagtaggaagtccgggctgctctgctagaaagagaagtcatgtgcag 
gagcactgaggcatcccaggtgtgacactcttccacctagagcattccgtctctcatcct 
ctgccatgtgacgctgggcttctttaacaaattaatcccaagtgcaagacatttatttct 
tctgtacctaatgacctgagcaatccttctctgctgaacctggtagtgtcatctttagaa 
gtgaagacacaattaacacatggtcatttcttcattatatcgttgttact 



IV exons 5, 6, 13, 17, 19 GB AY312368 SEQ ID NOS: 24 and 25 

tctaggactcagaaatatagatgttagtaagagcaaacagacataacagataacacatac 
aaagtgcctaccacatgctaaccactgctgcaggcactttctatagaagaactaatttaa 
tcatcaccataaccctatggggtagatgatatttttacaacctccattttacagatgaag 

M K 

aaactgaagcatagacctgcttatgtgagaagaaacgcagggagacagttcagtcactgc 
KLKHRPAYVRRNAGRQPSHC 

aatcttcatgcccatcagtttcttgtgagaagaaaacaagactggcaacgcctgcttcct 
NIiHAHQFLVRRKQDWQRLLP 

cctctgagctgtcaagtaggaagtccgggctgctctgctagaaagagaagtcatgtgcag 

PIiSCQVGSPGCSARKRSHVQ 

gagcactgaggcatcccaggtgtgacactcttccacctagagcattccgtctctcatcct 
B H 

ctgccatgtagcaaactgctatgcatccttcagctgcaagggattgaatgctatcaacaa 
ccatacaagtggagaagcagatgcttccctagctgagcctcaggctttttgatggaattg 
ctacaacttggtgcatgcctgctcctaaaagaaatactcaggaattgtctcataaagtcc 
tcacctactggcaaaaacaagatgttctactcccaggttgactttttcaagccccaagat 
gttgagtcagccattctccaaggatctcgatttccttttaatggaaaataacattaaaca 
ccaaatataagcctcgctgtcccacatgcgtattggggacaagatgaaacctgcttccag 
gctactttggcagcagaactgaaaaaggctttttttccagatatatgatttctcatcgac 
agggttgcacagccctctttattgttcgtgtaaatgacacccttggatctgaacaataca 
caccaggacaattgtgtgcaacagttctacaaactgatatttctaatta 
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Figure 20 (continued) 



IVB exons: 5, 6, 13, 17, 18, 19 SEQ ID NOS: 26 and 27 

gtctaggactcagaaatatagatgttagtaagagcaaacagacataacagataacacatac 
aaagtgcctaccacatgctaaccactgctgcaggcactttctatagaagaactaatttaa 
tcatcaccataaccctatggggtagatgatatttttacaacctccattttacagatgaag 

M K 

aaactgaagcatagacctgcttatgtgagaagaaacgcagggagacagttcagtcactgc 
KLKHRPAYVRRNAGRQPSHC 

aatcttcatgcccatcagtttcttgtgagaagaaaacaagactggcaacgcctgcttcct 
NLHAHQPLVRRKQDWQRLLP 

cctctgagctgtcaagtaggaagtccgggctgctctgctagaaagagaagtcatgtgcag 

PLSCQVGSPGCSARKRSHVQ 

gagcactgaggcatcccaggtgtgacactcttccacctagagcattccgtctctcatcct 
E H 

ctgccatgtagcaaactgctatgcatccttcagctgcaagggattgaatgctatcaacaa 
ccatacaagtggagaagcagatgcttccctagctgagcctcagctctgttttcctcagtc 
ctaggggcagattctaatttggtacagaaaaaaaaagtactcctgccaattcccatcctg 
gaatgactgaagaaaagaaaatgcatgtgattattttaaaggctttttgatggaattgct 
acaacttggtgcatgcctgctcctaaaagaaatactcaggaattgtctcataaagtcctc 
acctactggcaaaaacaagatgttctactcccaggttgactttttcaagccccaagatgt 
tgagtcagccattctccaaggatctcgatttccttttaatggaaaataacattaaacacc 
aaatataagcctcgctgtcccacatgcgtattggggacaagatgaaacctgcttccaggc 
tactttggcagcagaactgaaaaaggctttttttccagatatatgatttctcatcgacag 
ggttgcacagccctctttattgttcgtgtaaatgacacccttggatctgaacaatacaca 
ccaggacaattgtgtgcaacagttctacaaactgatatttctaatta 



V exons 5, 6, 17, 19 GB AY 312369 SEQ ID NOS: 28 and 29 

tctaggactcagaaatatagatgttagtaagagcaaacagacataacagataacacatac 
aaagtgcctaccacatgctaaccactgctgcaggcactttctatagaagaactaatttaa 
tcatcaccataaccctatggggtagatgatatttttacaacctccattttacagatgaag 

M K 

aaactgaagcatagacctgcttatgtgagaagaaacgcagggagacagttcagtcactgc 
KLKHRPAYVRRNAGRQPSHC 

aatcttcatgcccatcagtttcttgtgagaagaaaacaaagcaaactgctatgcatcctt 
NLHAHQPLVRRKQSKLLCIL 

cagctgcaagggattgaatgctatcaacaaccatacaagtggagaagcagatgcttccct 

QLQGIECYQQPYKWRSRCPP 

agctgagcctcaggctttttgatggaattgctacaacttggtgcatgcctgctcctaaaa 
S 

gaaatactcaggaattgtctcataaagtcctcacctactggcaaaaacaagatgttctac 

tcccaggttgactttttcaagccccaagatgttgagtcagccattctccaaggatctcga 

tttccttttaatggaaaataacattaaacaccaaatataagcctcgctgtcccacatgcg 

tattggggacaagatgaaacctgcttccaggctactttggcagcagaactgaaaaaggct 

ttttttccagatatatgatttctcatcgacagggttgcacagccctctttattgttcgtg 

taaatgacacccttggatctgaacaatacacaccaggacaattgtgtgcaacagttctac 
aaactgatatttctaatta 
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Figure 20 (continued) 



IB: exons 2, 3, 6, 7, 9 GB AY312365 SEQ ID NOS: 18 and 19 

gacacagagaagaccactcccacctccccgagtgcaaggtgtgaagggacagatctttta 
accatgcctgcccccttatacttgctgttcatagaattgcaactgaaagtgaccatgagg 
atccactggatggagttacttctttcttaagtgaggaggctaagatctgagttcttcaca 
tctctctgtagataaaatttccggtctggtttcacattcctctgtcagaagaactttctt 
taatgtttcttaaagtacaggtctgctgcttatgtgagaagaaacgcagggagacagttc 
CFLKYRSAAYVRRNAGRQF 

agtcactgcaatcttcatgcccatcagtttcttgtgagaagaaaacaagagctccaatct 

SHCNLHAHQFLVRRKQELQS 
gttagatggtattttgaagcaggtctttgggtaaaggacacctagacccagtgaaggtca 
VRWYFBAGLWVKDT 

tggtgattattattggacaatgggacatcactctgctattaagtgagaaaatgagacagg 

ttgcactgagggctgttagccagacagagtctcgaactggaagtccatctagatgttttg 

cataagagaatggaaacaatctgtctgtgatttagggacatactctggcagcaatatggg 

aatacagtttcaatcctcattaacaaaacaggtatgaaatacatatttatttagtaaggt 

gccagctgtatgaaaaatccatttcttatttcccataatgtttctgaaatgtcttagcag 

tgcatagagacagcatgtcatcattttctagggactgtgtgttattgcatttttcctagg 

gaagatcttttctaggtcacctgctccttcgctaaagctctgaccaatctagcttgctaa 

cctgtgactccattttcctaagtcctgagagagaaaaacgctttgcagcaaattatgcca 

ggcatccttgtgtctaaatgaaaaaaggaaaaagcctccttccttccctctgttgagaag 

tgcacggtccacatatgcatgcacagcatatactgtgagggtatttgcagtcccttgggt 

tgctttgataactggccaggttgctgttqtatttttccacattctattaatcctcctaca 

ggcagttattaggtattgagtgctcacacacccctggcatagtcaccacatgccattagc 

tccagataaacttccagaaaaaagtccatcccccacttctctcagctgcctgccaacgct 

ggacaccttctcaccaagccagcaggacagagaaaagcctgggctttaagatcaaacaaa 

cacagcttcaaattaggactctgtcacttcctgtgtactgggcactttgctgagtatgtg 

gtttctcatctgtaaaacagagaaagatgattatctcccaatctttctatgttatatgtt 
tgaattaaataaggtactctccatgaa 



n exons 1, 4, 6, 12 GB AY312366 SEQ ID NOS: 20 and 21 

gctgatggtggaaggagaatgagtctctgatgcctttggacttgatgctggaaagacttaa 
gactttgggggactactggaaaggagtgacttctccccagatttttgtatacctgactct 
gtttcagcatccgcttcccaaagaatgcagtgtgaagcaggagcttatgtgagaagaaac 

MQCEAGAYVRRN 

gcagggagacagttcagtcactgcaatcttcatgcccatcagtttcttgtgagaagaaaa 

AGRQPSHCNLHAHQPLVRRK 
caagtggatatacactgttccaagcagcatgtgttgaaaagatttgtcttttccccattt 

QVDIHCSKQHVLKRFVFSPF 
aatggtcttggtacctttctcaaaaattgaccatatatga 

NGLGTFLKN 
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Figure 20 (continued) 



VI exons 1, 6, 13, 17, 19 GB AY 312370 SEQ DO NOS: 30 and 31 

gctgatggtggaaggagaatgagtctctgatgcctttggacttgatgctggaaagacttaag 

MPLDLMLERLK 
actttgggggactactggaaagcttatgtgagaagaaacgcagggagacagttcagtcac 

TLGDYWKAYVRRNAGRQPSH 
tgcaatcttcatgcccatcagtttcttgtgagaagaaaacaagactggcaacgcctgctt 
CNLHAHQPIiVRRKQDWQRLL 

cctcctctgagctgtcaagtaggaagtccgggctgctctgctagaaagagaagtcatgtg 

PPLSCQVGSPGCSARKRSHV 

caggagcactgaggcatcccaggtgtgacactcttccacctagagcattccgtctctcat 
Q E H 

cctctgccatgtagcaaactgctatgcatccttcagctgcaagggattgaatgctatcaa 
caaccatacaagtggagaagcagatgcttccctagctgagcctcaggctttttgatggaa 
ttgctacaacttggtgcatgcctgctcctaaaagaaatactcaggaattgtctcataaag 
tcctcacctactggcaaaaacaagatgttctactcccaggttgactttttcaagccccaa 
gatgttgagtcagccattctccaaggatctcgatttccttttaatggaaaataacattaa 
acaccaaatataagcctcgctgtcccacatgcgtattggggacaagatgaaacctgcttc 
caggctactttggcagcagaactgaaaaaggctttttttccagatatatgatttctcatc 
gacagggttgcacagccctctttattgttcgtgtaaatgacacccttggatctgaacaat 
acacaccaggacaattgtgtgcaacagttctacaaactgatatttctaatta 



Vn exons 5,6,13,14 GB AY 312371 SEQ ID NOS: 32 and 33 

tctaggactcagaaatatagatgttagtaagagcaaacagacataacagataacacatac 
aaagtgcctaccacatgctaaccactgctgcaggcactttctatagaagaactaatttaa 
tcatcaccataaccctatggggtagatgatatttttacaacctccattttacagatgaag 

M K 

aaactgaagcatagacctgcttatgtgagaagaaacgcagggagacagttcagtcactgc 
KLKHRPAYVRRNAGRQFSHC 

aatcttcatgcccatcagtttcttgtgagaagaaaacaagactggcaacgcctgcttcct 
NLHAHQFLVRRKQDWQRLLP 

cctctgagctgtcaagtaggaagtccgggctgctctgctagaaagagaagtcatgtgcag 

PLSCQVGSPGCSARKRSHVQ 

gagcactgaggcatcccaggtgtgacactcttccacctagagcattccgtctctcatcct 
E H 

ctgccatgtgccatgttttgaaccactagattagagggtcaagcaatttcttggaatttt 

actctgaattctacgtagaccattttcatgtgtatacctcctctgagtcaccctcaggta 
gggacatttt 
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Figure 20 (continued) 



VIII exons 5,6,12 GB AY 312372 SEQ ID NOS: 34 and 35 

tctaggactcagaaatatagatgttagtaagagcaaacagacataacagataacacatac 
aaagtgcctaccacatgctaaccactgctgcaggcactttctatagaagaactaatttaa 
tcatcaccataaccctatggggtagatgatatttttacaacctccattttacagatgaag 

M K 

aaactgaagcatagacctgcttatgtgagaagaaacgcagggagacagttcagtcactgc 
KLKHRPAYVRRNAGRQFSHC 

aatcttcatgcccatcagtttcttgtgagaagaaaacaagtggatatacactgttccaag 

NLHAHQFLVRRKQVDIHCSK 
cagcatgtgttgaaaagatttgtcttttccccatttaatggtcttggtacctttctcaaa 
QHVLKRFVFSPPNGLGTFLK 

aattgaccatatatga 



DC exons 1, 4, 6, 11 GB AY 312373 SEQ ID NOS: 36 and 37 

gctgatggtggaaggagaatgagtctctgatgcctttggacttgatgctggaaagacttaa 
gactttgggggactactggaaaggagtgacttctccccagatttttgtatacctgactct 
gtttcagcatccgcttcccaaagaatgcagtgtgaagcaggagcttatgtgagaagaaac 

MQCEAGAYVRRN 
gcagggagacagttcagtcactgcaatcttcatgcccatcagtttcttgtgagaagaaaa 

AGRQFSHCNLHAHQFLVRRK 

caagtttaggaaaacttcctacaccttctttgttgggatgttctctggactaatgactcc 
Q V 

aggcgagaccaccgttgatcatgaactcactttgaaacagaagctgggttggtaagactg 
gagctact 



X exons 4, 6, 10b GB AY 321515 SEQ ID NOS: 38 and 39 

gagtgacttctccccagatttttgtatacctgactctgtttcagcatccgcttcccaaaga 
atgcagtgtgaagcaggagcttatgtgagaagaaacgcagggagacagttcagtcactgc 

MQCEAGAYVRRNAGRQFSHC 
aatcttcatgcccatcagtttcttgtgagaagaaaacaagaaaacaaggactgaaatcca 

NliHAHQFLVRRKQENKD 

cacaggaaggtggcagtgaactccacagacggacctggacgcctcaacactcctggcctt 
acctcccttgctgaacgtctcaagtttctctgcgttcaggtaatgtataggagggttatg 
agggcagagaattcctaagctcattagtaaattgctcttcagaaaagtgctttgaagcaa 
agctaatttcctttcccaatatgagaagatttggccctaccagaaaaaggaaatgatttg 
aatgtgcgccaaaaaatatgttttctttcttttctttgttgaacactcatcgggagttac 
tcttattagttccgcatttttattgccattt 
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Figure 20 (continued) 



XI exons 4, 6, 13, 17, 19 GB AY 321516 S£Q ID NOS: 40 and 41 

gagtgacttctccccagatttttgtatacctgactctgtttcagcatccgcttcccaaaga 
atgcagtgtgaagcaggagcttatgtgagaagaaacgcagggagacagttcagtcactgc 

MQCEAGAYVRRNAGRQFSHC 

aatcttcatgcccatcagtttcttgtgagaagaaaacaagactggcaacgcctgcttcct 
NLHAHQFLVRRKQDWQRLLP 

cctctgagctgtcaagtaggaagtccgggctgctctgctagaaagagaagtcatgtgcag 

PliSCQVQSPQCSARKRSHVQ 

gagcactgaggcatcccaggtgtgacactcttccacctagagcattccgtctctcatcct 
E H 

ctgccatgtagcaaactgctatgcatccttcagctgcaagggattgaatgctatcaacaa 
ccatacaagtggagaagcagatgcttccctagctgagcctcaggctttttgatggaattg 
ctacaacttggtgcatgcctgctcctaaaagaaatactcaggaattgtctcataaagtcc 
tcacctactggcaaaaacaagatgttctactcccaggttgactttttcaagccccaagat 
gttgagtcagccattctccaaggatctcgatttccttttaatggaaaataacattaaaca 
ccaaatataagcctcgctgtcccacatgcgtattggggacaagatgaaacctgcttccag 
gctactttggcagcagaactgaaaaaggctttttttccagatatatgatttctcatcgac 
agggttgcacagccctctttattgttcgtgtaaatgacacccttggatctgaacaataca 
caccaggacaattgtgtgcaacagttctacaaactgatatttctaatta 
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cattgaactg gaatataggt ccaaagaaaa 60 
acaaaaaata caaaaagaaa ataaaaagca 120 
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tatggaagta gtgaaaagga ctcccagaaa aagataaagc ccagactgga tagaaacaac 180 

acttatagtg gccaataatt ttcccaaact ctttaaagac atcaaaacac aaatacataa 240 

agcaccataa aggacaacag caagcaaaca aacaaatata aagagaatta aaacacatga 300 

aatgataaga ctaagtaaaa gccccataat cttgaatttg aaatgaaaat atttatgcta 360 

gctcatgatg tattttatgc ttcaggagtt tttaaggata cataccaaaa tgtttatagg 420 

taaactgata acatgtctgg gatttacttc aaaataattt gaggctgggg gtgggttgag 480 

atgtatagat gtacaaaatt gatattgata actgttggag gttggtggtg agtacattgg 540 

attctttatt ttttttatac tattctctct acttttatgt aagtttaaga tttttttcat 600 

aataaaattc aaaacaaaat tatgcatgat tataatgaaa caaaataaag agatctgggt 660 

agttaaagat gttaaagtcc ttggattgcc tggatgtagt aaaagtacta atttctatta 720 

gattttaatg tcaaggaggt gtgttaaaat ctctaaagta atcctcccaa attatgaaaa 780 

ctgacattac tagcaagcta aagaaggtat aaaatgcagt atttgaatat gcaaatatat 840 

atgtttatat atagagttat tccccagtat ccatggagga ttagttccag aatcccctgc 900 

caatacaaaa attcgaggat gcttaagtcc cttacataaa atagatagta tttgcatata 960 

acctacatac accctcctgt atactttaag tcatctctaa attacttata atttctaatg 1020 

caatgtaaat tctatataaa aatttttata ctatattctt aaaatttgta tcattttatt 1080 

attgtattgt tattttatac tttcttaata tttttaatct gcagttgata gaatccatgg 1140 

atgtggaacc cacagataaa aagggccaat tataatacat acatattata taaaatcaat 1200 

ctaagttaag acaggagaga aaaaatatta gaattattag aacaagtaag aagaataggc 1260 

cagtacactt aacctcaaat gtatcagtaa ttgcattaat tgtaaatgga ctaaatattc 1320 

ccattaaaat aacaagattg tcagagtaga tttttaaatt acctatttta atttgtttat 1380 

aaaaggtgta caccataaat aaaatgatgc aggttaaaat taaaaggaaa aggatggtaa 1440 

agtaaaatga taaaaaataa tgaaacaaaa caaaacacaa agtaaaagta agccaatgca 1500 

gctacatatg tgtatcaaaa gtttataaat ttgctgtgat tcttggaatc acttgtagct 1560 

ttatctgtgt cttagaattt gctttgaaag gaatacaaag acaggagggc ttctaaagct 1620 

agtaatgttc tatttcttga tcttgcttgt agttacgtgg gcgtgttcct cttgtaataa 1680 

ttcaccaaat tatatactta tgattttgta tacttttctt tatgtgtttt atttcaatac 1740 

aaagtttatt tacaaaatat atcacattaa tccagggcag gggttgcaat ctcaaatggc 1800 

tttaggaatc aagcaggcaa tgaatgtaaa catgtaaaag agccaagggg aagagacaga 1860 

atgaaaacag agataaaaac cagagaatga atgccctgcc taaagacatt aaaatcaatt 1920 
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tttgtcaaac actgtgctga tcaaacaaat cacacatgca ggccagattt gacccaaggg 1980 

tgactttctt acaacaactg atatataaaa tttaaacatc agaatttgaa aagtgtgcca 2040 

ctataaaatt ctagcccacc tctgaagtat gaacccacct tcaatttgtg aggtagaata 2100 

ttaatcttta ctggtggatt gtgagaagtg aggccaagaa gttggggagg ttaacattta 2160 

caatggttta gacaaatcat atgtttgaca aggtttaaaa tcccttctgt tcaaagaata 2220 

tcttatatta ggattttctt ctcctaatgt gtttattatt gtttttatta gtataatttt 2280 

taaagaaatc tgaactattg cacgcctcag agaccataac agtgagaaca aaaatataaa 2340 

ggaaaatttt caaatgatgg aaagtgtaac ccatataacc cgaaatcttc atgattttca 2400 

gatgaccatc tgattaagga gaaaagtcaa gataagcaca tattcatatc tgcttatttc 2460 

tagaattgtt ctatcagaag accatgaatt ggtcatatga ggggaagaaa cactgacatg 2520 

atactggctc tattaggatg atattaaaag aggctgctgg ctcaagctta gcatggatga 2580 

gttgttctca cataaagatg ctgttattga ccaggtgtgg tggctcacgc ctataatccc 2640 

agcactttgg gaggctttgg tgagtagatc ccctgtgtcc aggagttcaa gaccagcctg 2700 

agcaacatag tgagaccctg tctctactaa aaaatttttt aaattaaaag ttaaaaaatt 2760 

aaatgatgtt gctattgtta atgcattcct gcaatgctaa catgctttca taaatatact 2820 

tatgatttta taggtgtttt cctcttagcc tcaagaaggg agatgtttta gtctagggac 2880 

tgtcataaca aaataatata aactgtgtgg ctcaaacaac agaaatttat ttcctcgcag 2940 

tcctgtaggc tgaaagttta acattagggt gccagcatgg tgtggctctc ttcctgggtt 3000 

gcagatgtca ccgtctcact gcatcctcat gtggcctttg tgcaatgcct ctgcacaaag 3060 

agagagaaac agcaagttat gtgccgtctc ttcttgtagg ctcactaatc ctatggaatc 3120 

agggccccac ccttgttaac cttaattatt tccacagagg tcatatgttc aaatacagtc 3180 

acattgggag ttagagattc aacatataaa ttttagggga atacaaacct tcagtctgta 3240 

acagggaaca gaggtgctag aagaaatgta gtagtagcaa aaggtggggt ggggagataa 3300 

taggagcttc atatgtcttg ctaagaagtt caggctttgc ttttagcaca gcccctcttc 3360 

tgctagcttt gttcaggcca ccatcacctt tgatgcagat tactacaaaa gccacccaag 3420 

gtgtatgaaa tgagaaaaca catacttaaa ttaacccatg gccaggtttt gctggtaact 3480 

tgggaacaca gtaagggaag gaaaataact ttgttgtgga gagggcaaac caaggaaggt 3540 

cagaaagaca tagagacagt ggaagctcaa ggaaccggag gttgtcccaa agtcataagg 3600 

cggatgtaac gagagcacca gaacaacaga gctggaagga aggaaggaag gaaggatatg 3660 

tttagaaagc aagatgttcc cagatgaata ctgcagaggt agtataactc agtgcaacga 3720 

ttgtactggg taggcaaggt agggtggcag ggaaagtcac tagcaaacat ggccacaata 3780 
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atggaaacta aactttgtgt ttaccatgca ccaggctctg tgctacatgc tttctgtaga 3840 

tcatctcatt taatcctcac acagggattt aagcctgaag tttttttcag agaagacata 3900 

caaatcccaa gtaagagcat gaaaagatgt tcattagcat tagtcattag aggaaatgca 3960 

aatccaaaga tcagggagat acgacttcac accttcaaga atggctattc ccctataaaa 4020 

taatatacta aatcactctt tgaaacataa cattttccac aaactcacct ctttgaattc 4080 

acattttgct ttgccagtgt tttatgttat tcagacatga gtttgaatct cagcatttct 4140 

acttctgagc cctatgacct atgaaaagtg acaatttctc tgaatatcaa tttcccaatc 4200 

caaaaaatgg gaatacctct tggagttgtg gtgaggatta aataagataa tgaatataaa 4260 

gtgcctagta tagtaagagg catattattg ttcaataaat gctagctgtg aatgctactg 4320 

ctatagttat ttcattcact gtagaaatga ttaatttaaa ctccttcaag taacttacta 4380 

aagagtgaaa gttctccaaa aggcttgaat ttcactgcaa acctctttcc ttgttttgct 4440 

cttttattca ggacaataca ctttgctcag gtgtgtcttt gactctcaag cattgcttta 4500 

aagaatgacc agactttgtc atataccatt cttaaaggag ttcaggacat gccaccccaa 4560 

aatatgctgc tttcatatat tgattatttt gagctgatag cacttcaaca gcaaatgcaa 4620 

gagagacttt ttcggaactc agctgcctaa aaacagatcc tccaaaagaa attcaattgc 4680 

catatatcct ctccctggga gttttattaa tcagcaaaga tacactttta tcacaaaaca 4740 

ggaaactgga agtcaacacc acctacagaa aaactttttc ataaactgtt atatctccca 4800 

ttgattcttc taaaaaccca ttcatctttc cccagaatca tttactccac cctaagtgac 4860 

ttacatcacc cctccccatc ccctgttaac acagtatgta agctctcaaa tctcttttag 4920 

gagtattcat tattttcctg tggcacccca tgaacataat attaaaatga atatgccttc 4980 

tctcttatta atctgcctgt tgtcagttta tttcatagac acagctatcg aacctaggag 5040 

agtacaggga aattctttcc tcccctatat gcttaaatca taagattcaa ctgacatcag 5100 

caatgaaaca aacattcaga aaactaaatt cagagacatt atatctggct tttctgaggg 5160 

agaataataa gaactaatag gcattgtcag gtaaggcaat acagatgttt ttaattaaat 5220 

gatgtttttt caccatgacc cacatcttga aaacaccttc ctaattaaag accagtagtc 5280 

ttgagtgatt ctttgaagaa catttttact tccatttggt cttttcttta ttgagttgat 5340 

gaatctgctg gaattagatt cagcgcacat aacataaaat ccaaataatg ctggcttaag 5400 

ttcataagga tttattttta tcttatgtaa agaagtccgg acgtaggtaa tccagggctc 5460 

gtgctgcatt ttcttaccat tgggaaccca ggctcctctg tatttctact gtattaaagg 5520 

actgtcaccc tcaagttcag ctcatggtac agatggcggt tggttcaaga tgactgctag 5580 
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aacatctgtc atcatgccca aattctagcc agcaggaagg aagaagggag gaaggacaga 5640 

ggggctcata tcagctcttt gtcctcctct taattggcat tgccagaatt cccacccatt 5700 

agtttttcct tgcatctcaa tggtaacgtc taagtgaaaa aaagcctggg aaatattttc 5760 

atttattatt atttttttaa cccaggccca ttggccagaa taaaaccagc ttccttttcc 5820 

taaggaagaa ggagaaatag aattaaggaa cagtctctgg catagcttgt tctccatact 5880 

tttacttcca tccagaactg agaagtttag acaacaaatc tctttttgtc cacccattgc 5940 

ctgctttgag aatgcttctt ccagagtgct acgctctctt ctaatgactt attatcttgg 6000 

cagactgcac ttttcacacc ctccataggc tgcagtgatc cattggatga ttgtcccatt 6060 

atgtagataa gttcctattg ctggctgttt atgcttttct cccattttta ggctattagc 6120 

aatcctgcaa taaacattct cataattaaa tttgtatgag aattctgtat tatttattta 6180 

gaaaaagttt atagaagtgg acttactgca tcaaagaaag gcacttttgt ggcttttgaa 6240 

ttggctaatg tcttttttgg ttacccaagt taaaagatga tggaaaatgt cgggggacaa 6300 

atgctctaga acccgctagc atagacctac actaagtctt attcaaaaag gtccctgtaa 6360 

acaaaaccct agttgacttg cagtagctac agaatcataa ctggtttttt gagagcagag 6420 

attctctccg ttcctccctt agtgatactt agccagctac tggtaatttg cttattatca 6480 

aaaggagcgg ccaggcgcgg tggctcatgc ctgtaatccc accactttgg gaggccaagg 6540 

cgggcggatc acgaggtcag gagatcgaga ccatcctggc taacacggtg aaaccccgtc 6600 

tctactaaaa atacaaaaaa tcagccaggc gtggtggcgg gcgcgatggc aggcgcctgt 6660 

agtcccagct actccagagg ctgaggcggg agaatggcgt gaacccggga agcggagctc 6720 

gcagtgagcc gagatcgcgc cactgccctc cagcctgggt gacagagcaa gactccgtct 6780 

caaaataaat aaataaataa ataaataaat aaataaagca tttttttttt cactttagcc 6840 

agcgcatcag agaagaaccc acacttgatt gtcttatttt tccccctaca atgcccagag 6900 

cagtattcac acattcatga aaagaaaaca catcacttag aacattcgaa tagagcacgt 6960 

gtttttattt tgctcttcta tgttttacct ccacttctcc acaattttga tcccttcaaa 7020 

aaaaaatctt aactaattat tctctatata ttctattagg aatcttggct gtagacttaa 7080 

tcttgaggtt agaaagaaaa tagatcttga agccactatt ttggcacatt ctgtaattct 7140 

tagaatttcc attctaacaa ttttctggat gaatttaata accgtttata ttttatgagc 7200 

taaagtgcca attaaaatta ttcatttacc atctttaaga cttacttgtc tgttttattc 7260 

aaatgcaata aaaatgcgga actaataaga gtaactcccg atgagtgttc aacaaagaaa 7320 

agaaagaaaa catatttttt ggcgcacatt caaatcattt cctttttctg gtagggccaa 7380 

ate ttc teat attgggaaag gaaattagct ttgcttcaaa gcactttctg aagagcaatt 7440 
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tactaatgag cttaggaatt ctctgccctc ataaccctcc tatacattac ctgaacgcag 7500 
agaaacttga gacgttcagc aagggaggta aggccaggag tgttgaggcg tccaggtccg 7560 
tctgtggagt tcactgccac cttcctgtgt ggatttcagt ccttgttttc ctggattttg 7620 
agttgcattc tcaatacatt atttttattt ttattcatag tgaaaatatt gaagactgaa 7680 
ttttttctga attcatacaa tttatatgac atgttatttt ttattttcta aatattttac 7740 
tgttgaagta gttcctcttt cacccaaaaa aactaatagt ttttgctgtc tttttttaat 7800 
ttctacgtga ccatttcttt aatattttgt aattcacatt taattttatt gccttgtagt 7860 
caaggaatat ttctgaaagg acttctgctc tttggaactt attaagagtg tctttgaggt 7920 

ttaatatgta attattatct taaaaatatt taatgtatac taaaatgtaa gacacaattt 7980 

attgctgtgg aattaaacaa agaccatcca aatgagaaca agcaatggct atttactcag 8040 

agtttgggag tcagcccaca gtttcttgtg ttgacagaga ctcaaaaggc tgagtggaaa 8100 

agctttgcaa gggggaaaaa aaggcagggt ctcagctatg ccctgattgg aggctgctgg 8160 

cctagggaag ttgacctgaa gcacagcatc ttatcggatt gcttaagggt gcatatttac 8220 

ctttctccta ttggtcctaa gttagaagca agggaaacaa ttaaaggagc tatcagttat 8280 

gaaccaagtt ctgggtttgt tgggctaatt gctatggtgg tattgtttgg cttcctgagc 8340 

tggttgctgc aggttgtgga taagagttct attttatgta tatggtctgg ccattgtcta 8400 

tatgttcagt ctctcaatgc attatttgta cttattcttt tgaatttgcc tttactaatt 8460 

atggtctcta tattgtctct ttttatatac atattttttg cctgtttaac gtatgataga 8520 

tgactaaaag tgctattaaa aactttcaca cttttggttg gacgtggtgg ctcatgcttg 8580 

taatcccagc actttgggag gctgaggcgg gcagatcact ggaggtcagg agttcaagat 8640 

cagcctggtc aatatggtga aaccccgtct ctattaaaaa tataaaaatc agttgggcat 8700 

gatggtgcac gcccgtaata ccagctactc aggaggctga ggtgggagga tcgcttgaat 8760 

ccaggaggtg gaggttgcag tgagcaggga tcacgccact gcactcctgc actccagcct 8820 

gggcagcaga atgagactcc atatcaaaaa aaaaaaaaaa aaagagagaa aagaaaagcc 8880 

tttcacactt tctctcatct tttagtattt ttgatcaatg tttgttttat acatttcaat 8940 

ggtttattat ttggttccta aaggctcctg attagcatat ctttaatgta tgtttgtgta 9000 

tgtatgttga tctctctctc tctctctctc tctctcatac acacacataa gtcttttcat 9060 

ttattagctt tgctttaaac tttgctaatg tgatattaat attatgttta tctatcttct 9120 

ttttattgca cttgtttgcc aagcaccgaa ctaagcattt taaataatat cacatttttg 9180 

taactgtttt ggcatttatc ttcttcattg tgactttact tgaaataatt atcaatgtgt 9240 
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atttctatat ttgagtgttt caatacttat caagagtttt cttccccatt cttggttcct 9300 

gtgtttcgac acagctctca actttctgta gtactttggg tagtattttt tttttcaaga 9360 

agtcattgga taagattata aacacctggc atttctttat gagccttttt cttcactcca 9420 

caacaataat gatttagatg ggtatagaaa tcttgaattc ataactcaaa tctccataga 9480 

tgttgctcca tttaattctg gtattttgtg ttaccaaaaa aaaatctaaa gtcagtccaa 9540 

gatttctttc ttgataggtt afcccaatttt gtcttttgtt tctgattttt ttttttttag 9600 

atttcatgct ttcaccaggt tatgtctatt ttttggtcgt ctgttatgtg agttctctga 9660 

gctgccatat gctcttttga tctgctgatt aatgtgaatt cgtaggttat tttctttatg 9720 

tctttgaata cagattactt accatttatt ttaatcttat cttcaggaac tctaactgtc 9780 

tgtctatatg ttagtaattt tggcctttct gtcatcattt aaatatcttg gtcctattct 9840 

tctacgtggg aagaataatt tatactttat taatcaataa tttgattttg attcaatgat 9900 

tagcagtgct aattctactt tttattgttt ctgaagtatc ttaaaggttg ctttgtctat 9960 

gaaaagtttt aaaacattct tcctttgtat tatctggtgc ctttttattt ctgtcagctt 10020 

cttggtgttt cccttctcac aacctatcta atgtcttatt atctcgtctt tcacctttaa 10080 

ttccatactg tcaaagtgtc cccaaaatgt aaaaagaatg caaaatatgc gctaaaattt 10140 

cagggtttct tgagctaaag tttttttgaa tatatattct ttttctgata tttttgtgct 10200 

ctttttaact caatgctagg aaatctctct ctgtctcata tgtttcaccc atctactgaa 10260 

agttttttaa ttggctaaca tctgaataaa tgcacatatc ttttatttgg attattccta 10320 

cactagctgc acaatgccat cattgtttcc tccaccaaga agagctgggt gggccatgtt 10380 

cattgagaaa atcagtttct ctcaaacagg agaaaaaatg gcattgtcct atcgctatgt 10440 

gaataattca actaaaaaga ataactgttc tcccagtaaa gctttttctt tcttagaaaa 10500 

attgggtttc tttatattcc atgcctttgc ttcagagcac aatatgtgac ctgggaaagg 10560 

catttaccct tgttaaacct tcattttctc atttctaaaa ttacaagggt tatctccatg 10620 

atctttagga agccagaact gatctggttc taatggtcct ttgattctaa ttagaaattt 10680 

ccaactgcaa aagttgaggg ggattcttac atcttcagaa gccaactaat taaacaactt 10740 

ttgctgtaat gagtgtcagt cacctttatg aatcagcttg tcacttaata atagaaatag 10800 

ctggcttctg gcccatgcag aaggaatgta gaaaatagac tatagtaaca agctgactcc 10860 

cattctcctg ttgttccctg cctcactcag tttccacagt caacttagga caattccata 10920 

ctgtgattat ttactcctgc tttgctttcg taattcaaca aacagtatta accacttact 10980 

atggcgaggc atggttccag gtgctaagga cccacagtgg atttgattac aaattacagg 11040 

taaaataata aagcaataaa aaggataatt ttgctttcat catcgaaagc tcagtgatag 11100 
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agtaagcttt agactttatt tgcctcagca atgcagcagt ggtagagggc agaggagaag 11160 

tctcatccta agccttgctc ccaaggctgg ttataggatg ctgtattagt ttgctgggac 11220 

tgccgtaaca aataccaaag attgggcagc ttaaacaaca gaaatttact ttctcacagt 11280 

tctagaagct agaagtcctc tatcaaagtg ttggctaatt ttattttgag gcctttctca 11340 

ttggattgta gatggctgtc ttctccgttt ttttcacatg gctttttttc tgtccatgtc 11400 

tgtatcctaa tcttctcttc ttatgaggac accaccagtt atactggatt agggcccacc 11460 

cagaattacc tcatcttttt ttcttcatta tctctttaaa gtccctatct ctaagcatgt 11520 

cacattgtga aggactggag gttaggactt taacatatga atttagggga gatgcaattc 11580 

agcccataac aaatgcctgc tagtaactca tgggaactct cttccacatg caaagagata 11640 

aaatgtgatt tctgcaagtt ttattagaaa tataggaagg aacccttccc agcagggctt 11700 

gacaagcctg tgcatgtgcc tcactgacat aactaagtta tagggtcata tttgaactaa 11760 

tccatggaac aagacatgca attacactga tgagcttagg ctgaagtcaa atgtctcact 11820 

cctgaaaatt cagacttccc caaaacacat gggctgaaca ggtaagatgg atacttcaac 11880 

tgaaagccag ggtacttaac aaaagcagga gaaatagatg ctggggaggt aatcccaacg 11940 

tccagtgcaa atgctatgag ggaaaacgac agagtcccaa acatgctgga gcttatagtt 12000 

agaggaagga aaaattagat ttttctcagg gctttgggat tggggcacct ggcagtcagc 12060 

gcctagattc caaagcaggt cagaagatgc ctctgcccaa gtattggaaa ccttttgaat 12120 

atctttgagt attaaaaaca ttttaacact gcctgagaaa agtagtagtt atgaagttat 12180 

acatttagcc tatcccattc aactcagggg attgtacaca gatcaaataa attgcaattg 12240 

tgtttccctc cctgatattc acataggaag gtgagtggtg agcagtaaac ttcaggacct 12300 

ttaaagttat cattgctttc acccttgcac cagctttttg aggtaaatat tcccattttc 12360 

taggagagga taatgaggct cagagagggt aagtggcctc cccagggtct tctagctgaa 12420 

ttgtaaccag cttgatcttc cttttgaatt ccaccatctt gcctgtgagg gaaaaaggaa 12480 

taaggagcca ggagttgcag ggtcattctt ctcctcaaga ggcttgagag taactaagta 12540 

tctgttcaaa acacttttga gaatttggcg gcctcattac gtttctcagg agtccctggg 12600 

ggaggttaca cttttaggat ccccaggaat agagtgacaa aatacagcta ccacaacttg 12660 

gtcctgtgca actgtgccct tgctcagcaa ttactgactc cattctgact gaactctgcc 12720 

aaaaactctt aacattaata attattacct gttctcaatt ttaataatta agagttttac 12780 

aacttccagt gactcttaat tcaggtttct ttggcttact gttcttggac aataacagta 12840 

tggtcactgt tgcacagaca gaagctagac atagtctgta tagctaactc cctcatgtcc 12900 
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ttcaaatctt tcttcaaatg tcagtttctt ttggggcctt ctgtgaccaa cctgatgaat 12960 

agagcccacc agccccagac cacagcgttc ctgacttcct ttactctgct tcacttttcc 13020 

ctttttcaca gcaattgtca cattctattc tactataact tact tat tta ttgaatttac 13080 

cactgattgt ttgcctaacc tcactataag acagggatct ttgttttatt ctccaagaac 13140 

ttagaacagt agttattcaa ttaatatttg ttgaagcaat gaatgaacca aatgaatgaa 13200 

tgaaagatga atataaaaat atttatatgt cctaacactc tactaagcat tatgacagaa 13260 

tatccaccaa aaaaaaaaga tgagaaggag aaaccattat tcttctgtag cagggctaac 13320 

agaacatgag cagatggcaa acgtgagatt atcagcacct tctgggctag ttcctggaaa 13380 

tcagctgatt tcctgctgct tccaaaaatt cttaaagctt tcagactcct gagcaagctc 13440 

ctgcaaaaca cctattttgg tgctgcaagc acctgcagta gctctctagt cattcaggta 13500 

cttgttgacc ttaggaactc aagccatagc ttccacgatt ttcaatccct gatccctctt 13560 

cagtctcaaa ttgctgtatc caaaaccctt cctacttgca atatattaga aaggccacct 13620 

tctgaccaaa tctgaaaagt tgggtaggtt tgtagaggat gagtttttgg attattccca 13680 

attcttttga tcctcacccc aacctcctaa gactaaagct acagtggcct ccctttaggg 13740 

aagaaagcag aggaaaggag cagaggaatt attttctgtg aattaactgt ttctagattc 13800 

taatattagt gactaaactg ccatttgtac ttcctatatt cctgttttag caagagctaa 13860 

aagagtttca aagaaggtaa attgtattct aagaagagaa ttcagaaagt cctttccaga 13920 

atagatgaat cttaaatgat gggcaggttt cccacagaga taggtgggga agccattcta 13980 

acctagaaaa caggatgcct agaagtatgc agtgttttca gaaaatggca gacacaggct 14040 

tccttggcta gagaacaaag acttcataca ggagtaggtt cataaataca agcacaagag 14100 

acctttatag cattggggtc tagtgattct caattattct gcctataagc atcaactgca 14160 

agaatagtta aaattctatt gaagagaaga aatgtttcct gctgtgccag aaacccttaa 14220 

gaaaaagcaa aggaatttca cagagctgaa gatcaagcac ctgagaaaca agtttgcctc 14280 

aagagatgct tccaaaggca agttggaagc ttattgatga aaaagtgaaa cactatcaca 14340 

aggagtacag gcagatggga cagaactgaa attcaaatgg ctaggatggc aagaaaagct 14400 

gaccacttct atgtacctgc agaacgcaaa ttggcatttg tcatcaggat cagagatatc 14460 

aatgatgtga gcccaaaggt ccaaaaggtg ttgtagcttc ttcgcctttg tcagatcttc 14520 

aatggaagct ttgttaagct caacaagctt agcatgctga ggattgtaga accatatatt 14580 

gcacgagggt acccaaacct gaagtcatta aatgaactaa tctacaagca ttgttatggc 14640 

aagatcaata agaagagaac tgccatgata gataccaatt tgattgcttg atctggcttc 14700 

atctgcgtgg aggatctgat tcatgaaatt tgtactgctg gaaacacttc aaagaagcaa 14760 
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ataacttcct gtaccccttc aaattatcct ctctatgaga tggaatgaag agaaaaacca 14820 
ctcattttat agaggtggag atgttggcaa cagggaagac cagattaatg ggctcattag 14880 
aaggataaaa taaggtgtct gccatgatta tttttgtaat cttgtcagtt aataaacagt 14940 
gactgctttc gatttgaaaa aaaaatacat agttaaaata cacctacaca ttcacaccaa 15000 
agattctgat ttaattggta tgaagccaaa ccaagacatt agtacttttc cagagcttcc 15060 
caggtgattt taatttgcag ccagtgctga gaaacactga gacgatgcac ataaaatgcc 15120 
tagcccagag cctgctcaag aaatgttagc tgttgctctc tactgtcact gtttgggtgg 15180 
ttgagacttg tgcccaggct tcctagggga gtgttcctcc cattgcccca cagttccctg 15240 
taagtggtgg aaaaggagac cctatagcag cacttctcaa caccggctac acatttgaat 15300 
caatagagga gccttgccaa ggtttgcaaa agatggctgg gtctgtctct aaccaattag 15360 
aatctctgaa ggtgggaccc aggtatcccc attatttact ttccccatta actattccac 15420 
aaatatcctc atatgatctg atctctattt ccccataatg attgactggt atttctcttt 15480 
gccatttcag tattcacaat ctgctttgta ttaaagctgc ttatttacgt tatctctccc 15540 
actgtgaacc actgcagatc agggactgga tttgttttgt tttgttttag tttttataca 15600 
attgctttcc ccttgcactc agcatacagc aaagctctaa cacatgatat atacttacat 15660 
aatattaatt taattaattg tcagaaacca tagatactca actataagac tttcctgcct 15720 
tctccactct acttttccat ttttcatgtt ttctttgttg ttgtgctttt ctatgccttc 15780 
ttcacactgt agccaaagtt attttattca aaattcaaat taatttaatt atggaagctg 15840 
ggtaatgagc ctaataatat aaacacagtg gtgcagtgag catgcagtta gttacaatgt 15900 
tccatctatt tttctgttgt gtgaaggcat ccacaattta taaagtacct ggtgattcaa 15960 
aaattaagaa ccactattct aagagtaggt ggaggccaga ctatggaatc ttccaacaac 16020 
caggagcagg agttggaatg gaataaatta gagcactgga tcatccctga aatccctgag 16080 
cagaggaaca accgaaacag agatgtattt aagcttataa gtctaaataa aagttggggg 16140 
tttaaaaagg ctacttaatt acttctgaga aaaaatattt tattcatttc tcactgggct 16200 
tctataatca cctcagctct gtctgcagtt tcctttcttc tgtaattatt tttctctcca 16260 
ttttcttgcc ctgatagcca cgtgctcatt tgggtatcca tcaaaagact acatttcagt 16320 
atattaattc tctcactgga tggtacagct tgaatttcct tctgctagtt tcattaaaat 16380 
ctaaatgggg ggaaaagtcc tctacattat ctaggagatc catgtgagct gcccagctcc 16440 
attagctgtt gtctgttaga ggtggattac aagcaggtgt aatcttaagg acaccgccct 16500 
tctgtgattg ttaagaagga acacagactg agcagagcgg ctactgttct gatcaggagc 16560 
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caccaccact ggttcctaca gtatacagaa tttcaattgt cagatctgtc ctggaggaac 16620 

aatagaatac aactgtgaat taaataattt agaaaccaag gacataatct cacttttcta 16680 

aatgagtttg cctcttttag gagtcagtat attcgtagtt ttgctagcac agtcctgtga 16740 

tgtaggagga aaatgcccaa aaatattcta gaaaaaacat tgagaattca agttaaaatg 16800 

tcagatggag cagcctctag aattcctcct ttgtaggcca tcagtaatct cactgaaaat 16860 

tagacatgta ttatttataa ataataatat aatattatgc atattttaaa tattaatgat 16920 

ttaggtcaaa cgcagaaggt gaaatttgta ggtgtcagaa atgaaggcca aatctttagt 16980 

gggaaggctg tgggagctga agctctgcag agcacagtgg ggatccagat gcaatattta 17040 

gaggctggtg ttttcactct cctgtgcaga agtcaatgct ccaactctag agcatggaac 17100 

taggcttccc aaataagcct attgctggca aagctctgca tagtccgtga ctgggagatg 17160 

ggaaatctgg aaaactaacc atctgaccta ggcccatccc aggcctgtcc cggatccctc 17220 

cagaaatctc aagaaacagg atctcaaagc cacagggcac tatgtgcgag gcgccacgtc 17280 

cactggacac attcagggaa catcattcac atgaacatgt gtccttgttc tggttattga 17340 

ttgctgtgta acaaaccatc ccaaatatta gtggcttaaa acatcagttt attgcctttc 17400 

atatttctga gggtcaagaa tttaagtgga atacagtgag gtggctcatc ttttctccat 17460 

gctgtctgag actcagccag aaatggcttg actggcactg ttggcactgg tgccagctgt 17520 

ccgctggaag tctggcaaag atcgccagga aagttcctca gttcctcctc acttgggcct 17580 

ctccagaact ccctttctgg tgctacctgg gttccaagag gaccaggcga aagctgcaag 17640 

gactcctatg acccaggttc tgaaattatg aaatgtccat tctgccgcat tccattggtt 17700 

gaggatgagt cccagctcca gccagggttc agggagacgg gaccacacaa gggtgcgaat 17760 

aatatgaaac ttggctcatt gggggccatc tttggagact agtgaccaca cgtctggaat 17820 

tgtgcaatgg gccagtactt ggttgtggtt atagaaatcc cagtgtaaag cctaaataga 17880 

aacatgagtt aagaataagt caatccaatg gtgcacttca gaagtgaccg taaaactctc 17940 

ctgtagaagg actttcacat attagagcat ctacaaagct ccctcagttt ccaaagatga 18000 

tagatctcct ggaagataaa cttgaggaga aggtaggtgc ccactatcag agctggactg 18060 

tgagctccaa gctgggctca tttacctagt gggggtctac aggttactca tcctagaccc 18120 

agtcaattgc ccctgacctg cctcaggagt gaagcagagc cagggtcagg gctaggtggg 18180 

taggtgggct gccccagact taagctgtgc attgagaagt gccatagata ccaggaatca 18240 

aatgaatgga gatttgctat tatttggaag gatgtaagac cacaagcata tctataagaa 18300 

attgtaccag gatcacaaat ctcccatctc agcccagcag cagtgatctc tggaaagaaa 18360 

caccacccca catcccaggc tggtatactt cacagtaatg atgtgtgccc agattaacac 18420 
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aaatgacaaa ttccatcata ctccctctat ctgaaggagc acacctgatg tgagaaagag 18480 
catgagagga gcctggctgt cagagcagga attctgcact gtggcaggtt ctgctctgtc 18540 
tggctgatgc ttctgaatac agcagagata tgacggcctc ataatttagg ttgcttgaaa 18600 
ggcggagcaa gaaaagttac ctagtttaga aaaatcagga aactcatata tggaaaggct 18660 
atgctatccc aagccagcat tgctctggaa ctaagcagat gagaatgtga gcccagaaag 18720 
tttttctccc aaagaaatgc tctttttgac agccaatcag gcccaagtgg tttataaggt 18780 
agcccaagcg gcaaaacatt tgagtcagaa aaccctcacc ttggcacctc taccaaggaa 18840 
ttgtcagtca gccctacaga accttctgtc tggccttatc ttaaacttgt ggcagaagta 18900 
atgccacatg ttcacaatat cccatggatg gggtcatgtg atgtgttcag gccaatgaaa 18960 
tgtgagttga agggactatg tcactcccag actgagactg tgaaaagccc ctattagatg 19020 
gttgtccctc tgctttgagc tgtgagtttg gaggccacgt gttgagatgg tagagtcctc 19080 
agatcaaagc agccctgaat acaaaatagc tgtatttgga gcttgtgtgc tactgcagat 19140 
accataagcc tcgcctattt tgttactcca aagatgtgct gggttcatct ttgtagctag 19200 
cccaggggca ccatgatgct caggctacca tatgggaggg cgatccatga ggagcagcac 19260 
ctgggtggtg aatagcacct gggaagctga acctctgtgt tcagcttgga ggtagttagc 19320 
agatggagtt ctaggatgta ctaccatgag aagaaagtcg agggaagata aaagtcagaa 19380 
tgagggagat ggtattggtt aaattcaaag aaggaaggat aaaaaccttg gggaaagagt 19440 
ggaaagaggg tggcctccag agctgagtcc accaggatgt acaacaaatt atgcttgaga 19500 
tgagctgtcc aggactcagt atgcaaatgg gtgatctatg cctcagtgcc cacttattgg 19560 
aaaacacacc tgagatgaaa agcatgctct aaactctgtc actggagagt cttgccgctt 19620 
ggaccgtgag atgactataa caaaacactg ccattcatct tccaaaagag gaaatggaca 19680 
gacatataat tactagccca taaaatgatg gcaaataaca cagccctcca gacactatac 19740 
ttggtgccat ggaaaaattt gaacattatt gggcaatcac tgccttctga cttcaacttt 19800 
gacttaagaa gagcagaaga gtgtacatat tgggagcagg atgaagacac agagtttctg 19860 
gttcaaatcc tcgttctcct gctttctatt gggagatctt gcaagttaat tgcttgactt 19920 
ctttctgctt caattatttc atctgtcaaa tagagataaa agcaattcct ttttcatgag 19980 
gttagtatga aaattacagt ggatcatgaa gaaaagttct taggaccatg tctgatatgt 20040 
gagtcctcaa tgaatgatga attcattgaa tattatgata tcaccaggtt tttttaaacc 20100 
gctggccagg tatcccctca caatccttta tctctcctcc tctgtaaatg tccaacacag 20160 
gccacttctc ccctttgtac acctgctgtg caattcctgc cactttggaa tgggagcatc 20220 
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acagcttaag ggtcctaatt cctttctcag cctctgggta tcccaagtct ctaacttgcc 20280 
attgcctgct ttgcaccaag accagagtgt caggggcaga ttacaggagt cctacagtca 20340 
ctgctctgct aatgagtgct tcccttagag gtgaagaagc gaggcagtgc ctgctttgga 20400 
atgctttaat aaacaccaga aagtcctcac tgccagggat attttactca tttactttct 20460 
aaagcctaaa attctagaag ataggacaca ttcatttgct aaaatacaaa atttccaggg 20520 
agaggatgta taatgagtct tttttgtttg attgtttttt gagacagagt ttcgctcttg 20580 
ttgcccaggc tgcagtgcaa tagcgcgaac tcagctcacc gcaacctcca cttcccgggt 20640 
tcaagcgatt ctcctgcctc agcctcccga gtagcttcga ttacaggcat gtgccaccac 20700 
ccccaagtag ttttgtattt ttaatagaga cagggtttct ccatgttggt catgctggtc 20760 
tcaaactccc gatctcaggt gatctgccgc ctcagcctcc caaagtgctg ggattacagg 20820 
cgtgagccac tgcgcccggc cacataacaa gtcttatcag tcagtatacc tcagtataaa 20880 
ttaatttatc aggctccttt tctaaagggt tagatgaggg ggaaatgtaa gatggtggaa 20940 
gacttgggca tttgagaagc aattgagagg gaaaatgatc caaatgagaa ccctgggacc 21000 
cccagagaca gtcactaagg agttaccagt tggcatagaa cttctacttc tgctcccaac 21060 
tgtccaagct caaaatgaag cccagtaggg tcttcagggt ctcatcttgg aaaaggacag 21120 
cttccaagcc taggatttca gaatgtataa tggtcatttg agtgccgctc atcagtgaac 21180 
aatagctaat cagtgtgcaa attcagcagg cctgagctcc atcaccaaag gcattttcat 21240 
gaggacacat cctgcttgga gaattttgag tgccatcaga caagtcaaat cgctgccctt 21300 
ctctttactc aaaggatctg gaatcccctg aaccaggaaa aatgccactg tgacccttct 21360 
gggtgcataa gcacaggaat aaacacaata gatgggcaaa tccaactgta ccttatcctg 21420 
atgacatatt ttagggcagc agtttttaac tttctagttt acactcttaa aaataatatt 21480 
gagaacccaa aagaattttt gaggatgcag attaagttat ctctacagat atttatcatt 21540 
ttaaaagtta aagtggaaaa ttctttaaaa atattattta taaattcatt gtaaataaca 21600 
ataataaaat gttaacataa atgtaatacg ttatgctaaa taactcatat tttccaaaat 21660 
gacaaaaatg cagcttgcat tgttcacatt tttgcaaatc tcttcgatgc ttgtttccat 21720 
tcagctgtga tatcactcat gatgccacct ctagaaaact ccactctaca cacaagagag 21780 
aatgagaatg aaaaagacaa aggacagctc aggattgctg taaaaatagt tttgactttg 21840 
cagatccccc gaaggggtcc cagagaccca caggaggccc tgaccctact caaagacctc 21900 
ttttttagag gattgtttta ggtgattggg ctgttctgat atgatccatt cctaggcttc 21960 
taaacacctc aggcaagtct ccagggccca ataggaaggt caatacagcc tcttccttgt 22020 
taagggtaga cctacattga agatcaagga atcttggtgt ttgtagaata attatggact 22080 
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agcaaaggac aaagcagttg caggtttctt tgctgagcct ctgtgtttgt gagtctggaa 22140 

cccaaagggg cagcagctga ggggcaggga ctgagaagta gcaggatctg tctcttaata 22200 

gctgcaccaa aagggaaagt catgacattt ttctgagact tcatttgtaa aatggaaata 22260 

aagtccaagc acttcatgga gagtacctta tttaattcaa acatataaca tagaaagatt 22320 

gggagataat catctttctc tgttttacag atgagaaacc acatactcag caaagtgccc 22380 

agtacacagg aagtgacaga gtcctaattt gaagctgtgt ttgtttgatc ttaaagccca 22440 

ggcttttctc tgtcctgctg gcttggtgag aaggtgtcca gcgttggcag gcagctgaga 22500 

gaagtggggg atggactttt ttctggaagt ttatctggag ctaatggcat gtggtgacta 22560 

tgccaggggt gtgtgagcac tcaataccta ataactgcct gtaggaggat taatagaatg 22620 

tggaaaaata gaacagcaac ctggccagtt atcaaagcaa cccaagggac tgcaaatacc 22680 

ctcacagtat atgctgtgca tgcatatgtg gaccgtgcac ttctcaacag agggaaggaa 22740 

ggaggctttt tccttttttc atttagacac aaggatgcct ggcataattt gctgcaaagc 22800 

gtttttctct ctcaggactt aggaaaatgg agtcacaggt tagcaagcta gattggtcag 22860 

agctttagtg aaggagcagg tgacctagaa aagatcttcc ctaggaaaaa tgcaataaca 22920 

cacagtccct agaaaatgat gacatgctgt ctctatgcac tgctaagaca tttcagaaac 22980 

attatgggaa ataagaaatg gatttttcat acagctggca ccttactaaa taaatatgta 23040 

tttcatacct gttttgttaa tgaggkttga aactgtattc ccatattgct gccagagtat 23100 

gtccctaaat cacagacaga ttgtttccat tctcttatgc aaaacatcta gatggacttc 23160 

cagttcgaga ctctgtctgg ctaacagccc tcagtgcaac ctgtctcatt ttctcacttc 23220 

tagaagatgg gaaaaatgaa aactcattat tatggaaaac caccagtgac agacagtcta 23280 

atgtcagaat cagagaggaa ctgacactaa ttgtaaggca gatggagctg cattaggaaa 23340 

caccaagcca aggagccctc catgcccatg ccacacttag tttcaaaaag aaaaaaaaaa 23400 

aaggagccaa ggccaggcat ggtggcgcac gcctgtaatc ccagcacttt gggaggccga 23460 

ggcgggtgga tcatgaggtc aggagtttga gaccagcctg accaacatgg tgaagcccta 23520 

tctctactaa aaatacaaaa attagccagg tgtagtggtg cacgcctgta gtcccaggta 23580 

ctgaagaggc tgagggagga gaatcgcttg aacccgggag gtggatgttg cagtgagcca 23640 

agattgcaca actacactcc cgcctgggca acagagcaag attccgagaa aaaaaaaaaa 23700 

agggagccag acaaaggcaa ggcagaagca tggagccctg tgtgtcagga tctggcccac 23760 

cagaccaggg gttgctgagc tgtagatgct actaggctgt gaggcaagca gaatgtggac 23820 

tctataccag tctccacttt ctggaagaga aaacaagtcc catggggaaa tatgggcagg 23880 
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atgtaggggc agacataaga tatgatgcaa tgtgtagttc cccaaataaa gccctgaata 23940 
tgtgagttgg gaaaagttct ctgagtaaca gtaccatgtt gagagttgtg tgtgatgtgt 24000 
ggggtttggt gtgtgtgatg ggggatgggg agtgagtgtg catgtactta actgcagtat 24060 
ttaagagtcc tgaagagtca ggcaattgaa ggagaagaag ccttccattg gctagagacc 24120 
ccatcagaaa cttggagttg gcctggagct ttgacctgcc tccagatgca tgtcaggaaa 24180 
aagtcttatt tgttcactgt acaatgaaca gaaagcaggg gcaagagtaa tttattcaca 24240 
tatctgcttc taatcatcaa gtcggaaaag aaatctggat actatttttg agaaaaaaat 24300 
agagaaacca taaataattt ggaactgtat aagtacactt tataccaaga aaaaggaaca 24360 
acatctggaa ggctcaaaag aaaaggatat ctctgaaaaa aaaattatgt atatgtaaat 24420 
accttagtga ctagaaaaat atatttcaag gaaatgtttt actgtcatgc aaagttgtag 24480 
gaagacataa ataggaacag tggcataagc aaaaaggcag ggagggtaaa ggaataggat 24540 
ggtaagaaag aacaagaatg atcaaggagc aatgcttgag gcaataatga tcaaaacagg 24600 
tgtaattcat gttgcataac acccaatcca ttcctggaaa gacaaatttg agatgcattt 24660 
ccagaacgtc aaaggaaagg ataaggaaat gaaaacaaca aaataggaat ggaagatagc 24720 
aacgagtgtt cctggaagag aaaccggaat aaatggaaaa gaagcagaat caaacctaca 24780 
atagaagaaa cccttcctga gcttaataaa agacggagga ggagtcttgt gtgaacaaat 24840 
tcccagggct caccattttc aagcacagta attagaacga gaccccacct ccctcccacc 24900 
ttcagcatca catcttgcca aagcattctt attcattctc tcaactaaca cttatttcat 24960 
tccaactgca tgccaggccc tgcaattaga aaggttaaag aaaaaggccc caaagtttca 25020 
ggtagcacaa aattatgtat ttataatatg cacagtgact ttagatttct ttgtaaaaac 25080 
aaatatatca aaacatcatg gaaaaacatc tacagtttta agacaatagt ttgtgactca 25140 
ggaatttcat atgccgccga atttatgtat gaaagcaaca gtttttaaaa agatgacatt 25200 
aacattaaat gcttaaaaat agttaaaggt gttccagaac caatggagag ttggaacaga 25260 
attaaagtcc taagactaaa gaaataatgg tatagaaatt ctgaaattat taaaatatag 25320 
ataagcataa ttggtacaat gtagctataa tgaaatgata ttgcaaatat tataccagga 25380 
aaagaatata tataaatgta taaatcctta actcaaccat tccttttatt ttattttata 25440 
gagaaaagga gggggaaaag gaggggaagg agaaggaaga gaaggagaaa gaaaaagatg 25500 
gaacggtttc attagtaaaa aaaaattccc aatgaattat ttaaggatgg gtataatagt 25560 
ttctaaatga gggtgtaatt tgcaatattc attacaaagt gaaaaatgtt aaagtattta 25620 
agaataaaac aatgatggag ttgcttatat ggacaaaaca tacctataat gaaatacaat 25680 
agaataatgt ggataaaatc catccgtagg gtaataaact gtaatatgtg gtatatgtaa 25740 
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aaaaaccaag ttataatcta atacttagta caattctact ttggaaaaat atgtctatat 25800 

gtgtataaat agaaaataag gtctagaaat ctctgcagct aaatgttaac agttgttata 25860 

tctggaggaa aagattttgc atgcttatta tgttctgttt tatattgatt tgtttggggc 25920 

tcatctggac tttttagttt ttctggaagt tacatgtaaa ggatttaaac aaggctatgg 25980 

tatatttaga tttgtaaggg tagagacaga attgcagggg ccagtatgaa agagaaggta 26040 

attgaagcaa accaagtgag aggtgatggt ggtctaaatc cagacaatgg caagaagatg 26100 

gacaaaagtg ggcagacttg agctattcta agatgaggaa ctgatggtgt tggagatcca 26160 

ttagatatgg gagtgtggct gcagataatg acagttccgt atgtcagcat caacatggag 26220 

ggaagtgtct tgagccacta gaagatcagt ctgggaggag ttggcaaggc ctcttagaag 26280 

aaatcatgcc tcctttgagc tttaaaagat aatgaagagt cagccagctg gagaggtgag 26340 

gaaggccatt ttagcctgaa gtgagtgtcc cattatttga cctgtgtcac agtggccttt 26400 

cccagccaaa ttgcaaacag agtcaccaaa ttcgaataaa catttttaaa gggagaagaa 26460 

agcactgctc cctctggaaa ctgtgaacaa attaagccag agcaaatccg tccatcctgg 26520 

ctgcagcaat cccaagataa tgctgctcag ttctaggctg cttggcagaa aggggttatt 26580 

ctctgttatg tgcctcattt actcatgaat ggagaacttc tgggaatcca ggatcagcca 26640 

ggcacagcac agttactatt tctgggaaga tttcatttga ccccaattac ttacaatagc 26700 

agagtgatgt cccattgtcc aataataatc accatgacct tcactgggtc taggtgtcct 26760 

ttacccaaag acctgcttca aaataccatc taacagattg gagctcctgt caaaacaaga 26820 

agagtcatca cactgacagg atctccctga ggacaccaag tgcctaagaa tttgaacaaa 26880 

gcatcctttc tgtgcaggag gaagggggca caagatttct aaatagcata aataaataaa 26940 

taaataaata aataaataaa taaataaaga tgaattgtaa ttatctatgt ttagcagaca 27000 

tttgaagtag aaataggtca ctgggatttc catttcaaat ctctattgaa attctttacc 27060 

aaaagaattc ttttgcactt gatctatcca gccttttttt gagaggggga tctcactatg 27120 

tttccaaggc tagacttgaa ctcctgtgct caagcaatcc cttcatctca gcctccggag 27180 

cagggctttg ggtgtgcacc cccactgcac cccgctcagc actttttatc agctataatt 27240 

tataacaata atagtgataa taattataat aaaataatat gaatgtttgt taccatctgt 27300 

caggctctgt tatgaatata tcactgtctt tataacaatc acatgaagaa gctaactcaa 27360 

tttctacaga ggtcagagag aaaaattatc catttggttg aaccgaaacc catctatctt 27420 

ccacactgag ttagtctttt attttaaata aatatgttgt tatcaagaaa ggaaatcatc 27480 

cattatactt tcactttcac ctttgctttc aaaatcaccc aaacctgaat tactagctca 27540 
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tttctccaaa 


agcataatta 


atattgactt 


tctttttatt 


atctcctata 


actagatatt 


27600 


gaaaaacact 


atagctagca 


taaaggaaaa 


cttctgaaca 


gtgtagccat 


attgccacag 


27660 


aaacagattc 


agaaaaataa 


acctaagaac 


tctggagaaa 


gaatgcatgt 


tatataagtt 


27720 


tctccctaga 


gtttggcagc 


ctttgttcat 


agttggatgt 


aaacttattc 


atttctaccc 


27780 


ataagcccac 


atagctttac 


ttcctgacat 


ctgttctctc 


ctcccaggat 


cggaaaggtt 


27840 


tcttccttct 


gataaagtgt 


gaaacatttg 


tgaggaaaag 


gtgggttcag 


agagggagag 


27900 


aaagacgaca 


gcctgattcc 


tagatctgag 


gtgggacgca 


ggaagatgga 


tttcttttaa 


27960 


ttcccccttc 


aaagttattc 


ctcctcatgt 


gcttttggga 


cbatgctttg 


agaatctctg 


28020 


gagtggagta 


tctggtggtc 


tgcacatagt 


ggtgaagagc 


gtggagtcac 


agaatttggg 


28080 


gtaaaatccc 


agctctatgt 


ctaatgagct 


gggtgaactt 


gacaggtcaa 


ttcactggct 


28140 


aagcctcacc 


ttacctgttt 


ttaaaatgga 


gagtggtgat 


ataagaaaag 


tgcttagcat 


28200 


agtatctagg 


ttacagtagg 


tgaccaataa 


atgggagtga 


taactaccat 


cat tat at ta 


28260 


catggaaagt 


acaaaacaaa 


tagtatcata 


gtatgtagca 


aagtcctcat 


tttgttggaa 


28320 


aagagtgatt 


aaataaaacc 


gcaaatgtct 


gtgcctgtgg 


ggtgagatga 


gggagatgtt 


28380 


caacggtaat 


agacagcaaa 


gtgttaccaa 


gctacGcaaa 


tggatgatcc 


atttaatttc 


28440 


tacatggaaa 


aggagcaaaa 


aaggaaaggg 


acctctgaaa 


cccttctaag 


tacccacaca 


28500 


cacacagact 


cacacacaca 


cacacacaca 


cacacagaca 


cacacacaca 


ctcacacaca 


28560 


cacagagctc 


tcagaacact 


gctgccagag 


ggtgaatgca 


gatacaactt 


tcataaccat 


28620 


tgctgaaatc 


acatatattc 


caagaaggga 


aaatacttgt 


aaaactgaaa 


cttattaatt 


28680 


aagatactat 


agccaaaatg 


gatgtcacaa 


tgcttgtctg 


aatacccata 


ataacattta 


28740 


tgtaaaattt 


actatgtccc 


atcaactgtc 


ccaaatcctc 


taccgggata 


agatcacgct 


28800 


atcttgacag 


ccactctagt 


tggtaagtgt 


tggttttgag 


acatctacaa 


ttgaggatac 


28860 


tgaaggacag 


agacactagg 


taatctaccc 


aggctgcaca 


cctcatagac 


agttgaggca 


28920 


agactcatat 


ataagttaac 


gcatcccaga 


gcctgtgctc 


ccaaacacag 


ccctcataca 


28980 


gaactgctcc 


tgtagttaag 


aacatagtga 


ggtccccaga 


atgtgggcaa 


aatctagtgg 


29040 


agagactctt 


ttccgcccca 


taccagctgt 


tttcaccaga 


ttgttagtaa 


aaagaaagtg 


29100 


gatcaagaga 


atgagaagac 


aagccacagg 


ctgggagaaa 


atattggcaa 


aagacagaca 


29160 


tcggataaag 


gactattgat 


atggtttggc 


tgtgtcccca 


ccaaaatctc 


accttgaact 


29220 


gtaataatct 


ccacgtgtca 


agggtggtgc 


cgggtggaga 


taattaaatc 


atgggggcag 


29280 


tttcccccat 


actgtttctg 


tggtagtgaa 


taggtctcac 


gagatctgat 


ggttttataa 


29340 


atgggagttc 


ccctgcacaa 


gctctcttgc 


ctgccgccat 


gtaagatatg 


actttgctcc 


29400 
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tccttcacct tccactatga ttgagagacc tctaaatcat atgtggagct gtgagtcagt 29460 
taaacctctt tcctttataa attacccagg ctcaggtacg tctttattag cagcatgaga 29520 
acagactaat acaactatgg tacaaaatat aacagaattc ctaagactca acaataagat 29580 
aacaaatagc ccaattaaaa atgggtaaaa gacttgaaca gacacctcac caataaagat 29640 
acacagatgg caaatgaaca tatgaaaaga tgctccacat catatgtcat cagggaatgc 29700 
aaattaaagc aacaatgaca tagcactaca catcttttag aatggccacc atccagaaca 29760 
ctgacaacac caaatgctgg tggggctgtg gaataacagg atttctcatt cactgctggt 29820 
gggaatgcga aataggactg tcactttgga agacagtttg acagtttctc acaaaattaa 29880 
atatactctt accatataat tcatcaatca cacttcttgg tatttatcca aatgaattgg 29940 
caactcatgc ccacacaaga atctgcacat gaatgtttat agtagttata ttcataacag 30000 
ccaaaacttg gaagcaacca aaatgtcctc agtagatgaa tggaaaaaca aactatagta 30060 
catccagaca atggaatatt attcagcact aaaatgagct atcaaaccat gaagatatga 30120 
aagaaaatta catgcatatt actgattgaa agaagtcaat ctgcaaagac tacatattat 30180 
gattcccaag tatatgacat tctggaaaaa aacaaaacta tgaagacagt aaaaaaaaaa 30240 
aaaaaaatca gcagctttca gggattaggg agggggaggg attaatcagc agagcatgga 30300 
ggatttttag ggcagtgaaa ctatttcata tgttattata agggtgtata tgtgtcattg 30360 
tacatttgtc taaacccata ggatgtacaa caccaagagt gaaccctaat gtagacaatg 30420 
gactttgggt gatgatgatg tgtcaataca ggttcatcaa tagtaagaaa tgtaccattc 304B0 
tggtatggga tattgagagc agggagtggg ggttgtgcat gtgtgggagc agagagtata 30540 
tagaaactct gcacttactc gattttgcca tgaacttaaa actgctctga aaaataaggt 30600 
gtttttttta atgatgcata tttgccaatt atattttaat aaagctggga gatcaaaagg 30660 
agtggaggct atcttcaaac acatgtgcag ccttttaaaa ttcatttata aacaggatct 30720 
ccaggagggt gatcatgttt atcttgttca ctttttaaaa agtcattttg aaaaagcaat 30780 
aaattcaaag gaaattgcaa aaataaagag gcctgatgta ttattgacca agtttccctc 30840 
aatggttttt gcataacatt taacgtaact ctagtttgat atcaaaatca ggacaatgat 30900 
gtatgtacaa tgtgtgtata gctggacacc actttgctac ctggataagt tcatgtagca 30960 
accaccacaa tatcttgttc acttttgtca tccggtgctg atcatgattc tcaacaatac 31020 
tcttaagtga ctcagtcaac aaacaaataa gcaagtgaat gaacagatga ctgaatccat 31080 
gacatcagta aagagcatgt ggttcttctg atttgcatat gcactcagtc aatctcacag 31140 
atcattttta tttgtataaa gccttgagct gagagcagtc tattaaatac aaggattggg 31200 



wo 2004/056866 PCT/FI2003/000973 

19/111 

agcacctcac cacacattaa agtgcccttt cttccacttc aacttgacta attgagccta 31260 

tgtgagtgta actgccattc tcccaaagat cgttattgaa ataatataac ataaaagtgg 31320 

aggaaatgat tttttaaaaa tccctgatga taggaaagag tgctatccat atggttaaaa 31380 

aaaaaacctg aggctttctt gccagattca gaacaggtga gatggaaaac caccatctgg 31440 

gaagaaaaac aaaaaactgc ctttattttt ttttttgagg tgtaagtttt ttttattatt 31500 

attatacttt aagttttagg gtacatgtgc acaatgtgca gtttagttac atatgtatac 31560 

atgtgacatg ctggtgtgct gcacccatta actcgtcatt tagagatggg* aagagatacc 31620 

catcaaaact tcattctcaa cccccaaacc aagagctgga aataaaaggt gtacaaggtg 31680 

ccagttacag aggaaatgca ccagaccctg tcgctggaac agggtatcag aacatcctcc 31740 

caatctattc tgggccaagg aggcatgctg gaggcgggct gggctgagct gggtacatga 31800 

tgtggggtgc acgctggcag agccagttcc cacctgagcc agtgatgcta actagagagg 31860 

cgtgcagtgc ctcatccccc gacaaagctc agaagctcca ggtagagcag acgctcttgc 31920 

cttaatcttt aaaaataact ggactttgat ctggccaaat ggaactggag aaaattctcc 31980 

ccctcctttt gtctgttcac ctgaggtttt tcaggaacca tccaagcaat taaacaggag 32040 

tgacattatc tctgaagaaa ataatttcag ttaccagttt ggtattatga aagacccttg 32100 

ggaataaccc ctctgatggg cttcagaccc atgatagcta tacgtacttg ttttcttctc 32160 

acaagaaact gatgggcatg aagattgcag tgactgaact gtctccctgc gtttcttctc 32220 

acataagctg taaatgggaa caaagagaga agattaaatg tctgtaaatt ttcattgcag 32280 

cccctttttg tttagaaagc atttctgctt accttttata tttctattaa ataatagaga 32340 

gaaatttact cctccagctc ccaagagaag aatgaaagtt aaattagaat ttatagtgga 32400 

cccacaattt ctgacctaag aacattaata caggaatcca aacccctccc agctcttctt 32460 

tgctatagtc atcacccact cagaggttca gctctgaaca aaccccttct gagcctctcc 32520 

caggaagcct tatcaaccca cagtccagaa ccaccttcac cgtttgagaa cctcacaaag 32580 

ggtacttgca gtgctagtga tgcctacaga ctgcttggcc ttgtacttga gatggagtga 32640 

agacgaggga aaccccttga ttgttctcag ttttcccctt tgtgccttga aatttccatc 32700 

tcattgtgaa taaaaaagtc aaagcatatt caatcccaga gaggggttga atccctgaat 32760 

ttcacaagat tgaaagatca tgaaaaaatt gaccccctct ggaaagatga gaaaaatcaa 32820 

aaagatcaat gaacacatac aaagagtctg tatctatggc tcttattaac agtgactaaa 32880 

gaatttatct gagtaattta gctaattcaa ctcaataagg aaacaacata ataaaagata 32940 

taactgaaaa aatgaaatta tcaatttgga aatgatatga ttaagaataa aaaacaagat 33000 

tgtttttgaa aaatcattgg gcctaataag acatgtaatt aaagaagatg attataaaac 33060 
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caacaaatat aaaaagcaac tagcttctct gtacactatc aataaccaat tagaaaatac 33120 
aatcttaaaa aacaaaagat taaatattca agaacaatca aaatacaaaa tgcatatgaa 33180 
caaacttttt tgtttgtttt ggggacagag tcttgctctg tcacccaggc tggagtgcaa 33240 
tggcacgatc tcaactcact gcaacctccg cctccccagt tcaagtgatt cttctgcctc 33300 
agactcccaa gtaactggga ctacaggcac ccacccacca cacctggctc aatgtttatt 33360 
ttttagtagc aatggggttt caccatgttg gccaggctag tctggaactc ttgacctcag 33420 
gtgatctgag tgtcatggcc tcccaaagtg ctgggattac agacatgagc caccgtgcct 33480 
ggtcatgaat aaattttaaa agaattgtga tgacttacat gaagaaaatc gcacaagttt 33540 
actgagtgac attttaaaac tttaataatt gaaaatccat tcaatgttcc tggtaaggga 33600 
acagaattaa tggtcagaaa caaatcaaat atatataaga acctaggatt tttttaagtg 33660 
catttgaaat gaatagagaa aaaagggatc atgtctccag cacattgatg ttcctgcttt 33720 
tctgaacaca cttaactctg tgttgcctga agacctctgt acttgctgtt ccctctggcc 33780 
tgaacagtct tccctcagag cctgtttctg atctctgctc aatgttcttt taaaggatag 33840 
cagtgtcttg gggtctttat aactttgggg cttatctatg gctagcaaat gtaggacata 33900 
ggtgtgtaga gtatgcaaag caggaaggct ctcaatggtt aaatatctgt ttatctggtt 33960 
atgtttaaaa caattggatg tgtaaaaatt tgagcgtggt gctagtgggc atttaagcta 34020 
acagatctca gcctactatg aagaagtcat aaacccagga gtaatataca gaagccttct 34080 
ggctcatgtg catgacacac cccacctaat tctcaagctc ctccctcttc attttttatt 34140 
atagcaacta tcactctgtg acattataat ctatatctat ttggttgtgt gtctgttatc 34200 
tgtttttctc attagaatgt taactccatg gaggcaggaa ccatgtcatc cttgttgact 34260 
gcattttcag taccaaaaaa aagtgagtgc ctggacagaa tggaaaccfca ataaacattc 34320 
accaaaggaa gtaggagagg agaggggcag gagagaatat agtaaaatag gttaaagttt 34380 
ccaatatagc catgaaatta tttttaggaa aagaatttca ataactcgca tagaaaaaga 34440 
ttttttaagc atgataccaa ggcagaaatc cttacaaaag actaacagat tttactgcaa 34500 
aatataaaat ttacaaaatt taaattgtaa agattttttt aatacaaata gaattataaa 34560 
cggatttaaa tgtttagaat ataaataata tttacatagc tgatcaagag ttaatatact 34620 
cactaaaaaa ggactttcta caaatcaata agtagaagat aaatacttaa tacaagaata 34680 
aatacaaata taggaacaag taagatacac acaaaatgca attcaccagt aaacctgttt 34740 
taaatgttta attttgccaa aaactaaata aatgaaaata aaaataacaa tgagttacca 34800 
aattttccaa taagaaaaga aagtaggttt taagaaattg gtagtgcatt aggccatttt 34860 
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tattaaataa acctaaaatg gtatcaatga aatatttaaa agtacccctg ggttacattt 34920 
tgctttttgg tactttctaa ttattccacc aatatgtatt acttatgtaa ttcaaagaag 34980 
tgattttaca agaaaaaata aatagtatgg taaatcgctg gctcctagag agtgggccaa 35040 
tcttggaaaa caatgcccca gtgctgagaa ctgtgtaggt aagttgoatc tttoagtagc 35100 
acaaaggaga gattggtttt tcactaottt agatgagggc caggagctga ggggttaata 35X60 
gtgcacagat aaatcttgaa ggacaaatga agttttccac atggccaaag gaaggagaaa 35220 
ttataggcag atgaatagaa cgtgtgaaaa gagaggaaca caagacagca tgggccaatg 35280 
tagcatcata ataaaaataa gcaacaoaag gccaggtgoa gcggctcatg cctgtaatcc 35340 
cagcaccttg ggaggccaag gcgggcagat cacctaaggt caggagttca agaccagcct 35400 
ggccaacatg gcgaaactct atctctacca aaaaaaatta gccaggctcc acggcattcc 35460 
agcctgggtg aoagagtgag acttcatctc aaaaaaaaaa aaaaaaaaaa aaaatagcaa 35520 
cacttggaat tctcagtttg tagaattcac aactgatact aaaagtctat catgtaacag 35580 
ggctccggtt gaagttcacc tcactggtta tgagttaaag acagggtttt tgatgagact 35640 
gacacagaaa atgtcagcaa gaaaagaaag gacacatttc tctgatctcg cactctgggg 35700 
aatgctggaa caggtcatga gtgtaccttc tgcaagaggg gcccagctcc ctttctcact 35760 
catcaagaag ttgctcaact ttctaatata tcagtagata aagcaagttt accagtgagg 35820 
ggaagctagt ggtgggggca ggaactggac caottgggca ccagagggta tgattttgat 35880 
gtoatttgtt tagctgctta ggaaagocac ccactgccag cttaagattc aggggaagtt 35940 
gagatcaggt gcagccccac ctcctgacat ggggaaaact oattgtcctc actcattcat 36000 
catgcaggca acctggggag aatttccacg gttgcttttg gactgcccta tttgtgatac 36060 
cagagctcta tgaoatttca ttaattttaa ttttactgag gctgaatttg actcactcat 36120 
gcttgaaagg tagggtgtat aacaoatcac taaacaagca aatgaggctg tggaccacca 36180 
gcagtcacat ctcaactaga cttctgttca ggtatttatc agcacatatg catgctcact 36240 
cacaaaatga gtagccacaa aggggtgcta attcotaatg atggtggtgg ccgttaagag 36300 
tgaaattggt agttcttgga cagaaaagag agttctggat aaatgacaaa gaagaaaaat 36360 
attaacacat ctaaaagttt gcatgtgaat acagcacttg gtaaaagtaa aggtgatttg 36420 
tgaagctgtt ttaataattt cttgaaccaa ttcaaaaatt gctcatcaac ttaaaaatta 36480 
gacaaatgaa tgatgaaaog gtggttaaea gagtgttaao aggtcactga ctcatttaag 36540 
aatccatgca ggccaggtgc agtggctcat gcctgtaatt tcagcacttt gggaagccac 36600 
agtgggtgga tcacgaggtc aggagttcaa gaccagcctg gocaacatgg tgaaacccca 36660 
tctctactaa aaatacaaaa attagcoggg cacgatggcg gttgcctgta atcccagcta 36720 
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gaagacctgg ttgggggaaa gataattgag atagtgttat tagaagaaga aatattttta 38580 

gggtaggcaa aaacaacaga tgatgactaa aacctcaaga gagtctctcc actcccttgg 38640 

ccctcatcag ggtcatgcac ctgaggaatg attcatgggt gactctatcc cctccctggt 38700 

ctccattcat agccatgtgt ggaggcagag atgaccatgc tggagaagag accaccccag 38760 

aagacacagc taaaataaag gcaaatttgt tgtgccattc ttggcagggc catctctctt 38820 

gtttattatc ctggagccct cttcccctcc tctgttttgc tgaccttaat aaatgaatac 38880 

atcaatatat gtttggaaat acatctgttt ctacatacct tatgccttcc ctccaaagaa 38940 

taaattaagt gattagtaga atataagagc tgaaaatgtg agagctatct gctcagaatc 39000 

cagataaaac agaatggggc ttatgaaaag tagtgcttga ttagttagtt tcaaaaatcc 39060 

taaaacagtg ttttcaaatg ttcatttctt tgaaacagta gttaaggttt tgatactctc 39120 

ccctgaggaa gtcatctaaa actgctgata aagcttttgg caaagagctt ttcattacgg 39180 

ggctatttat aacaacagaa aaccgtaaac aacttaaata tccagcattg aaagaacagt 39240 

taaataatag atggtaaatc cacacaaata atattatgtt ccattaaaaa ctattgacaa 39300 

taaaattgta atagtataaa aaaatctcaa catgaataaa aagcagaata gaaaattata 39360 

tatgcaccat aaactccagc tatataaaaa agtgaagcct cacattaaag acagggatga 39420 

gaaggatcat ttaataatgt aaataagagt tgttttgatg gtgataaaac atataagatt 39480 

gtcatctccc aactttctac tttcctctag tttttatatt ttgagtaaga ataatttaca 39540 

caaataactt acccatgttt agttctcagt cttcatcttg gttaagctgt cagcagaatt 39600 

tgacacatct gatcactccc tcctccctga aacaacctct ctgccctccc cgtctcctct 39660 

agtcttcata taataataca atataataaa ggtatctatt tatgtgtttg tctgacacac 39720 

aatactgtga attcctttac agcaggtact atgctatact catcttcatc tgttttttct 39780 

ctccttaccc agattgaatt ccacagtcat ttttataagt ctcttttgca catcctcaac 39840 

ttccttcacc ctgcctcttt tcatgatatt actggcaaaa ctacaaccct ggtaaaatcc 39900 

tgctttttct ctctttgtac ccacactggc gaacaaagct ggaaacaaac acacacatgc 39960 

acacacacac cctacctact ctccttgctt acttacttac ttactttatt tatctatcta 40020 

tttattttag agacagcatc tcactctgtt gcccaggctg gaatgcagtg gtgccatcat 40080 

agctcactgc aacctggaac ttctgggctc tagtgatcct cccgcctcag cctcccaagc 40140 

agctaggact atggtcatgc agtaccacaa tcagctcatt tttgtatttt ttatttttgt 40200 

agacataagg tctctctctc cttactttaa taacctagaa ctcaaataag tcacagattc 40260 

tacccagcaa ccatatgctt attcccttac ggttcattga attctccaaa aattatcatt 40320 

agaatttttt aaatattcgc aattttcaca ggataagaaa gcagtatttc atcctacatc 40380 
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tcgcactgcc otgaactctt tottactttc tctctcttct aactttttag cacctcctct 40440 
ttcatcctct cctcaaccag atgactttgt ttctactcca ctgagaaagc tgaagcaaca 40500 
gaagggaatg tcctcactcc ctggcctctc tatacctttg ctgccaatat cagcattcat 40560 
atcccctgcc tctcacctga aaagtatgag taaactctcc atgtgacaag gttaaaccat 40620 
cttotcatgt tttagacacc accacctctt acctactcta gatcttcgct ccaaaaatct 40680 
tcccccaacc ccgattaagt tttctctttc tgctgattat taaaatccac atttttcatg 40740 
ctgttctttc tcaaacactg aaaacacaaa cgaaaaagaa gaagcttatt catcccactt 40800 
ttcootocat ctcgcactgc atttctttac cacccattag aggaaaactt cttgcagtaa 40860 
ttgtccatac tcactgtagt caattcctct tctcccattc coattcaaat tgtotagoca 40920 
ggcttctgcc agtaaagtgg ctctggttta agtacccttc agtaactttc atattctaaa 40980 
ccgagggtcg attatttgaa cggacagtag tattcaacac aactggccac tccctcttct 41040 
tgaaaatact tttttacttg gtgtctagcc accaaactct cctggtctgc ttcgatccta 41100 
tgaattgctt cttctcagtc tcttttattg gttccttoto ttatttctga ctttattaat 41160 
atttctcata ttaatagagt coctacccct cagatcaatg cttggatett cttctgtact 41220 
tagggacatc atcatgtctt ggggctttta aagtgtctct acacggatga cccctgaagt 41280 
gatatctcct tccaggagtt ctcttttgaa ctcaagtttc cttagtccaa cacagactat 41340 
gtttactgcc tatatcccac caccggaatg gaaactctgt ccaaggaaga tttttatctg 41400 
ttttattcao tgcacctaga aaggtgcctg goacattacg gacatttaac aagtgttttt 41460 
gaacgaaagg ataaataaat aaatgggaaa ttggagcatt atggttgaag gccaggagag 41520 
agcatttttg gaggcataaa gagccaaaat caaatcaaag catcttgaac aagaaattca 41580 
taaaatcttg ccttttgtgt actatagtat cttcataaaa tcaattaggc aataccttct 41640 
ttcactattc tctagaagat tttgtaaaag accaaaatga gatgtttctt gaatgcctgg 41700 
tggaattcac aggtaaaaca atctgggcct gatgctttct ttctaggaat attttacact 41760 
aacagattcc aottctttag ggttacaaaa acaatcaggt tttctatttc ttotcaaatt 41820 
atttttggca agttattttt taggaatgtg tgcattttat taaagcattc aatttactga 41880 
cataaatgtg tcaatagtgt tatactattt ttttctgtct ctactgtacc tatagttgaa 41940 
tgtctgattt ttacatcaaa tagtttgttt ttggcttctt ttttttcttg taatttabat 42000 
cacaaataot ttttcagctg gcttgattga tcctttctat tatatttggg gtttttattt 42060 
gttccotgct tttgtcttct tcatctcctt toctctgatt taaattgatt tatataccat 42120 
tattcttttt ctaacatctt tttattttaa ttgaggtaaa atttatataa catatcattc 42180 
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atcaaattaa ccaagtgtac aattaagcgg cagttagtac attcaaaatg tggtcaacta 42240 
taagatctat ctagctccaa aacccaaaag gaaacttaat aggaaacctc agaaccatta 42300 
aacagtcact tccccacacc ccccatccag ccctagcaac catgagtctg tctatgaatt 42360 
tgcccattat gaatatttca tataaatgga atcatggtag aatatgtggc tttttgtttc 42420 
tggcttcttt cactttgcat aagatgttta aagttcatcc atattgcagc atgtatcaat 42480 
acttcatttt tttatggctg aaaaataata ctccattatt acatattatg ttttatttat 42540 
cagctcatcc actgatggac atttgggttg tttccagctt ttggctattg ggaatgatgc 42600 
tgctattaac aatcagacat aaatatttgt ttgaaacctg gtttctattc ttttaactat 42660 
atacctagga gtaaaattgc tgggtcatat gataattcta tgtttaactt tttaagtaaa 42720 
tacaaaattg ttttctacaa tgactgcatc tttttatatt ctcatcagca atgtacaaga 42780 
gttctaacat ctctacatcc ttgccagcac ttgttatttt ccttttttaa attataatca 42840 
tcctagtggc tgtgaaatgg tatctctgtg gttttggttt gcattttcct aataattaat 42900 
gatggtgagc attttttcat gtgcttgtta accatttgtg tatcatcttc agagaaatgt 42960 
cacttaagtc ctttgttcca ttttttagtt gcatgtccca cagggtattc ccttgatgta 43020 
gtactctccc cacttttcct agggatgtgg cttcctgaga gctgagctct agtgattgtt 43080 
atctctcttt tggatctagc caccgagcag gtctaccagg ctccaggctg gtactgccag 43140 
agtctgccca gagtcctgtg atgtgaacca tctgtgggtc tctcagccat ggatactagc 43200 
acctgctctg gtgcaggtgg caggggggtg aaatggactc tgtgaagttc cttatttttg 43260 
gttgtttaat gcattatttt tgtgcttgtt ggcctctgcc aggaggtggt gtttttaaga 43320 
gagcatcagc tgtggtagta tggggaggat caggtggtgg gcagggccct agaacaccca 43380 
taagtatacg ccgtttgtct tcagctagca gggtgagtag gaaagaacca ttagttgggg 43440 
acagggctaa gcatgtctga gctcagactc tccttgggaa gtcttagtgc agctgctgtg 43500 
ggtaattatg gtgtggttcc caggtcaatg gagttatgtt cctgggagga ttatgcctgc 43560 
ctctgctgtg tcatgcacat tgtcagggaa gttggggaaa gctggcagtc acaggcctca 43620 
cccagctccc aagcaaccca aaaggccggt ctcactccca ccatgccccg ccaacagcac 43680 
tgagtttgtt tccaggcagt gggtgagcag ggctgagaac ctgtcccagg ctacctgcct 43740 
cccagctgtg aaagcaagta gagcttcagt acttccccac ctgtggagtc tgcataccag 43800 
atacacaccc tcccccaagt tctgtccagg agacttctag tttggtagca attgttacga 43860 
agttcagctg gaggtttcct tctccttgta gccttttccc agtgcctctg gcagcactcc 43920 
ccaaggaccc ctgtgaggca aggcagaaat ggcttgttag gggacccaca gagcccacag 43980 
ggctttcctg ctgcttcctc tacccctgta tttcacttgg ctctctaaat tgacttagct 44040 
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ccaggtatgg 


tcagaatatt 


ctcctgtcat 


ctagaccttt 


aggttaccta 


gtgacggtgt 


44100 


gtgtttgggg 


gtggacaatc 


tccctttccc 


actttcacag 


tttgggcact 


cacaggattt 


44160 


ggggtgtctc 


ctaggtcttg 


caggaacaat 


ctgcttcctg 


cagagggtct 


gtgggttctc 


44220 


tcggctttcc 


tgatttattc 


ctgcagtcat 


tctggagcaa 


gagttcatga 


agcaagactc 


44280 


cacatgctgc 


tctgtccatc 


taagtgacag 


ctgcaatcta 


gtcctgcctc 


ctgtccaccg 


44340 


tgagcctctg 


taaatcctgg 


atatggaatt 


cttgattgac 


aggattattt 


ttctttcagc 


44400 


accttaaata 


tgttatccca 


ttgccttatg 


gcctccataa 


ttcctgaaga 


gaaacctact 


44460 


gtcaatctta 


tggaggcact 


cttgtacatg 


acaagtcatt 


tcttttttgt 


tacttttaag 


44520 


attgtctctt 


agtctttggt 


ctttcagaaa 


tttagtaaca 


tggtgtcttg 


gtatggatat 


44580 


ctgtaagttt 


ctccttggag 


ttcatcaggc 


tttttgcatg 


tgtagatttg 


tgtcatttgg 


44640 


gaaggttttg 


gccattattt 


ctttaaatat 


tctttctgtc 


cctttatctc 


tctgccttcc 


44700 


tctaagatac 


ccagaggtga 


aagctcaggg 


tcttctcatg 


ttttttttga 


gagtttattc 


44760 


agtcctgagt 


atgcatttga 


tctctttgat 


ttacttgtac 


atattagtgc 


ttttgaaaat 


44820 


tgttattccc 


ccatatatct 


tctttcccaa 


cctcattctt 


cctaggattc 


,ttgatctatt 


44880 


gtttgtcctg 


actgttatca 


cctgctccaa 


gctgctgtgg 


acaattcctt 


tggctttaaa 


44940 


tattttttag 


caaatgtggt 


tcaggaagct 


ctatgaccct 


gggaatattc 


taagttaggt 


45000 


aaaacaaagg 


cacactttgt 


actgatcctt 


cagggaccct 


gggaatattc 


taagttaggt 


45060 


aaaacaaagg 


cacactttgc 


actgatcctt 


cagggagctg 
« 


ccagacagtc 


aaaacacaca 


45120 


accaaaactc 


ttcaagaata 


aagtctatac 


tgtcccctgt 


gacattagca 


aactggaaca 


45180 


gtattttcat 


ggccaccatt 


actctgggga 


agaggggtga 


caagcaggca 


attttaaatt 


45240 


ctctaacacc 


ctcttaccac 


aatgcagcag 


cttctttctt 


cattaagctt 


cccctggttg 


45300 


ttgttagttt 


ttcactgtgt 


ttcctagttc 


tgtgaaagtt 


gattctacca 


gtttttgcca 


45360 


gcatttgttg 


ctggatgtat 


ggaaccttag 


agttccctac 


tctgccattg 


ttggcaaaat 


45420 


caattgcacc 


tttttttaac 


ttgtcaaatt 


agttgcttag 


atagtttatt 


ttcactttct 


45480 


tcttctctaa 


tataagcagt 


taggaaccat 


aagtttttct 


caaggatttt 


ttagtttcat 


45540 






^1 ^ 4* s ^ ^ n ^4*> 

gCbdCuCuCu 




CCuuuCCCCC 


ctccaggcuu 


45600 


99gggtgcat 


gtgaatgttt 


gttacatagg 


taaactcatg 


tcacaagggt 


ttgttgtata 


45660 


gattatttca 


tcacccagat 


attaagccca 


gtacccaata 


gttatctttt 


ctgctcctct 


45720 


ccctcctccc 


accctccact 


ttcaagtaga 


ctacagtgtc 


ttgtgtctcc 


ttctttgtgt 


45780 


tcataagttc 


tcatcattta 


gctcacactt 


gtaagtggga 


acatgcggta 


tttcgttttc 


45840 
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tgttcctgca ttagtttgct aagcataata tcctccagct tcatccatgt tcccacaaaa 45900 
gacatgatct cattcttttt atgattgcat agtatcccat gatgtatatg tactacattt 45960 
gcttcatcca atctgtcatt gatgggcatt taggttgatt ctatgtcttt gctcttgtga 46020 
atagtactgc aatgaacatt tgtatgcatg tgtctttatg gtagaatgat ttatatttct 46080 
ctgggtatat atccagtaat ggaaatgctg ggtcaaatgg tagccctgct ttcatctctt 46140 
tgaatggcaa tactgcattc cacaatggtt tacactccca ccaacagtgt ataattcttc 46200 
ccttttctct gcaacgtcac cagcatctgt tattttttga ctttttagta atagccattc 46260 
tgatggatgt gagatggtat ttcactgtga ttttgatttg tatttctcta ataatcagtg 46320 
atattgagct ttatttcatg tgcttgttgg ctgcatgtgt cttcttttga aaagtgtctg 46380 
ttcatgtctt ttgcccactt tttaatgggg ttgtttttct cttgtaaatt tgtttaagtt 46440 
tcttataagt gctagatatt aaaactttgt caaatgagag tttgaaatat tttctcctat 46500 
tctgtagggt gtctttttac tctgttgata gtttcttttg cgtgcagaaa ctcttaagtt 46560 
taattagatc ccacttgtca attgttgatt ttgttgtgat tccttttggt gtctttatca 46620 
tgaaatcttt gtcccttcct gtatccacga tggtattact tacgttgtct tccgggtttt 46680 
tacagttttg ggttttacat ttaaatcttt aatccatctt gagtttattt ttgtatgtgg 46740 
tgtaaggaag gggtccagct tccatcttct gcctatggct tgccagttat ctcagcacta 46800 
tttattgaat caggagtctt ttccccatga cttgttttct ctctctctct gtcttctttt 46860 
ttttttttgt agatggagtc tcactctgtc acccaggctg gcgtgcagtg gcatgatctt 46920 
ggcgcactgc aacctccgcc tcctgggttc aagcaattct ctgtctcagt ctcccgagta 46980 
gctgggatta caggcaccca ctaccacgcc tggctaattt ttgtattttt agtacagacg 47040 
gggtttaacc atcttggcca ggctggtctt gaactcctaa cctcgtatca gctttgttga 47100 
agattagatc atgtgttatt ttcattacta tctcattatg atcaacaaat ctataatttt 47160 
tattatgatt tctttttggg caactgttac ttagaagtgt gctttttaat tttcaaaagt 47220 
accaggcttg tctagttata tttttgttat cattttctag cttactttcc tgtatgtttt 47280 
agtttttttg aatcagatga aatgacatat ggtctaatat ataataatct gatactcttc 47340 
tatgtaccct tgaatacatg tgttcagccg atttgagaag tattagtttg tatgtttaca 47400 
ttaggtcaag tttgtttgct gtgtagttaa aatctgctct atccttgctg attattatta 47460 
tgtattccat ctttaaataa ataagagaac ttgctaaaat ctgaagttct ggtggtggat 47520 
ttatcttttt ctcccatagt tctattattt tcttacttta tttattctaa tgggcttaga 47580 
attgctatat cttcctggtg aatttaacta gtattcaata tagagtatct ctctttatct 47640 
ctagtaatgc ttttagcatt aaaaagattt ttatcttaca ttgatatcac tacaggattt 47700 
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ttcttaatta atatttgtct ggtataccat tttattgttg ttttaaaatt ttatgttttt 47760 
atgttctagg tgcttccttt ttcttctttt ttgagagcaa gtctacatgt gcctaacctt 47820 
gtgtctggta gcagtgggcc tcaggataaa aggggcagga agagcagagg aaatgggaag 47880 
gaccccaatt ttgctgggtg actttctggc acctagcagg gattcaatag atgtccattg 47940 
aactgtcaaa atgaatgcag atggctggac ttccttctag tgfctaccctt ggactcagat 48000 

gaagtctccc cattccttct tcctgtttga aacactagtt gtcccattgg gaatccctca 48060 

tccccaaggc cttctgctca tgatgaataa tcaatcaggt ttgtgattca aaacaacatc 48120 

catcatctct gtaaaaccag ttccccatcc taaagctatg acattgtttg acaagggtat 48180 

gtattattta aaaccaggac tttgtcatgt tttcattatc tgagtacagt tggccctttg 48240 

tatccatggg ttctgcattt atgaattcaa ccaattgtga attgaaaata ttcagaaaaa 48300 

aatgcatcta tactgaacat gttcacactt tctccttgtc ataattttct aaacaatata 48360 

gtataacaac aatttatata gaacttacat tgtattagga attaaaagta atatagagat 48420 

aatttaagct tggttaatat atgcaaacac agcatcattt tatttcaggg attttggtat 48480 

ccatggaagg ctatccatgg aaaccaataa ccccatatat acaaagggac aactatattg 48540 

gcatccttta ctgattatag aacattccta gtcactacca tagtgcctct ttcacattct 48600 

ctttattatc ttcttctgga actccagtgg gatttttgtt ggactttctc atcctattca 48660 

ttatgcctct tcacttctct ttggtctatc tctttgtctc tttctattgc tttttgagta 48720 

atttatttgg tccaatcatc aagtttacta tttctctttt tattggttcc aaccttataa 48780 

tcaacatatt cattgagatt ctaactttat tacaattatt ttttcatttc taaaagttct 48840 

actttgttct taaatttatc tagtcaatta gagaattcct ctttcatcaa acttgcaatt 48900 

ttctgcttaa tttctttcaa tatattatct attaatttat ttatttagag atagagtttt 48960 

gctctgtcac cagggctgaa gtgcactggc atgatcacag ctcactgcag acttgaactc 49020 

ctaggctcaa gccatcctcc catctcagcc tcccaagtag ctggaactat aggtgtgcac 49080 

caccaagccc agctaatttt tttttctgga gatagggtct cactctgttg cccaggctgg 49140 

tcttaaactc ctggtcacaa acagtcctcc tgccttggtc tcccaaattg ctgggattac 49200 

aggtgtgagc caccatctcc ggcctatttt taaatattat gtaatactgt ttatctgata 49260 

ttcccaatat atgcaatctt tgcacatctg atttctgtgt gtacacaggt gttcatgttt 49320 

atacatttga gttttccaaa agccatagta cactcatgag aaaatgaaag tgaaaagaca 49380 

aatacagtct tagtattaat tataaaaata gtttttgcct aatggattca gagtcttagt 49440 

gaccctcaga ggtcctcaga ctacacactc agaaccactt cactaaagca tccatcgata 49500 
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caatcagtta tttctttcct gtgcctccaa aatgtttcta cctcaatacc aaccttggaa 49560 

aaattgagaa aatgaaagct caaattattt tccatggagc ttaattttct ctgcaattct 49620 

gatcaaaaaa gattgtaagc aatgaaagaa tcaaactagc tgtattcagt agtcctccct 49660 

tatccatagt ttcactttcc atggtttcag ttaatagtgc aatagagtaa gatattttga 49740 

gagagaatga ccccatattc acataacttt tattacagca tatatttatc attatcatat 49800 

ggtatcagta tattattgtt gttcatttct ttatgtgcct gatttataaa ttaaatttta 49860 

tcatagttat gtatgtatgt ataaaagaaa acatggtata tacagagttc agtactatct 49920 

gcagttggaa ctcttcaaac atatccctca cagataaggg gagacctgct gtaatatttt 49980 

tgcttcttaa ccaagtgttc tccctccacc catccctgtc ttagcatttc tatcattaca 50040 

taattctcta tttggaaaac tgtatcatga ggtttttact gccacaatgc aggttaatta 50X00 

ggaaatgaga aaaacaaagt agcctatttc ttggtcttga gtttcaagtg atttagaagg 50160 

aaaaatgaaa accaggctac ctggatccaa gaatgggaaa gagtgaggag tatgcaggga 50220 

ggaggagaca agagcttaaa catggaggca tttctggctg ggtgcggtga ctcttgcctg 50280 

taatcccagc actttgggag gccgaggcag gtggatcacg aggtcaagag atcgagacca 50340 

ttctggccaa cgtggtgaaa ccccgtgtct attaaaaata caaaaattag ctgtgcatgg 50400 

tggaacgcac ctgcagtcct agctactcag gagactgagg caggggagtc acttgaacct 50460 

gggaggtgga ggttgcagtg agccgagact gcgccactgc actccagcct ggcgacagag 50520 

caagactcca ccaaaaaaac aaaaaaaaaa aaaaaaaaga ggcatttcta agagagagac 50580 

cttaacccta aggcagacac tgaagaggat ttggggcatg aaataggagc tgcaggttgg 50640 

agaacatcac aagcaagggc atctgtaacc caccctaccc actttctcaa gagacagtcc 50700 

agagagcgag taaggcacag gggccctctt ctgcatattg aaagtgagaa gaaatgccta 50760 

tgcgagtgtc taggcagaga gcttgaaaca gccacttgtg ggtctctgca cctgcagcct 50820 

ggtgcaggga tcagaagaat gtacttattc tgtgcaacaa gagccatgcc tttgtgacaa 50880 

agcccgtggc ccaagaaggc cctgcatttg gaacaaggag gcaccacttt gatctgctga 50940 

cagctaagtg caaggtggaa tcagagatat cttaatagat gtcaatgata acagatgata 51000 

ccaagaacca ggcatcttag acacacacac acatacacac acacacctcac gtgcacgagc 51060 

acccacgcac gcatgtgact ggataccgca tgagttttct aaagcttaaa gatgactaca 51120 

aggacaaaga gtaaacactt aattgactgc aattaagttt ttgatatcca gcagagtagg 51180 

agtttttact agcaattaac ttcagtttta gaacacgaca aatcttattt ttattataca 51240 

actacaaaca aatatataat gaatgctcag attccaggac cctatacctg gggtgatggg 51300 

tgggacgagt taaagaaggg atctggacac tgctgcctct ctgccttctt tatatgggga 51360 
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ccgtatcaca aatgaggctc actgtgcctc cagtcatgtt tactgtgcag caaggctttt 51420 
ggccagagaa cataattgac atgcacagag ctggagctct tgccagctca cagcaaaagg 51480 
gcctctaatt cagagaaaaa ggagttggac ctctgccagt catgtacctg ttatgctttc 51540 
atataggcat cagttagacc ctgatctcag tctgacaaac cagaaaatat aaaaccatca 51600 
ggatcaaggc atacttttta ctgggagtgt tagcctggta atggagagga atgttattgc 51660 
tctaagcatt cgcataatat ctattgatca aacctctgtg aagctgctat tcttcatgcc 51720 
agcgtctggc ggcaagggag gactatttca ggtctcacca tcacagccac ttcacactca 51780 
ggccatgcca cagtcaccag cacagcttgg gattggggca ggaagtcccc agggtgacca 51840 

gacagaaagc agcctcagga cggggccaga gcatgagact aggggtggtg gcagctgggt 51900 

tttatttact ctgggctgtg tgaccttggg caagctgcca gcccattctg ttcctctctg 51960 

catgccatac aaccgggaat tcctacatta ctgagtcact cctgggccaa ggtttgatgt 52020 

ccctattaaa atgctgttga gagagtaagt ggcttcagtc actagccctg gagaatgagt 52080 

tcaccagttt gatttgttta caactgagaa ctctttcagt ttgtgtgcgt gaggtgctgg 52140 

gggagcaggt ggaagaggct ttgctggagt gggggaaata tcctgcctgg gagtgggatt 52200 

ggcggaaggg ggcatatttg tcaatcacaa tagaatccta gacacttatg tctgggaggc 52260 

atagcatagg ccattgcatt catcttcccc ccagtacctg aatctgtgac caatggaact 52320 

gaggatcttt acttaacaac ctctagtgat gggataatca cctccttcta aggcaagcta 52380 

cttggaaact ctaagatatt actgagaaaa gaccctgagc ccagccttga attagtccaa 52440 

agtctaatgc ttctcccagg tgacagccct tgaaagagtg tgtgacaggc gccatgcccc 52500 

ctgaatggtc tcttctccca aataaataat acctgattct ccctctgcaa tgttcatgac 52560 

ccaaacaaag ctgagcctct gtgtggccac tctcagtttg atgttgtacc ccaaaccttc 52620 

aacctcagtc ttaatgcctg ggaatggggg aatgtgtgga aaaaggcatg aaacacagta 52680 

cacagcaaaa tgtcttaaca tttctcttga ttctctagct ccttatccct tttcctaaaa 52740 

atctctttct aaatctttca aggataaagg gaagggggta gaaaggggaa gtgaggagag 52800 

gtaaaggaag atgttcatta ttgggagcct ataatgttcc aaacacgtgg gacatttaat 52860 

cttcacaaag taagtatacc caccctaatt ttataaggta gaaactaaga cccaagatat 52920 

ctaagtaagt tgcccaaagt cacaaagcaa ttacctggca aacaatggat tcagataatg 52980 

actatatcat aggccaagcc caattacact caaaataatt cacatcttcc taagaggcaa 53040 

gccctggggg cccaagaggg ggttgttgag taagggcaga aagtacaagg gagacaagtt 53100 

gccaatcaca gcttggccct gattattgta ataactgaca caattaaagt tcaatttcgt 53160 
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atccttagaa ggtttccgtc atattccaag gctgatgaga aatttccatt ctaagaatgg 53220 
ggatctgttg aatgttttaa attcaggcac ttaattaaat cttttaagat gactctttct 53280 
gtgtttgccc ttagtctttt atttctatcc aatattgttc ctggaccaaa tagggtcggg 53340 
ctgctgtttc ttgtagccca ataatgagat gcagatgaac tggggaggaa gagagttttt 53400 
tttttaagta tataaaaaca tttattcatt agaaaacaag gagactggca aacatatatt 53460 
ccaaagtgaa agcagctcaa tgcagttcag ttaggctaat ttaagagaaa ggccttgcat 53520 
tttaaagatc gtgtatgtat tttttttttt tttcaaaaaa ggagacaggc aaatattcta 53580 
caaggggaac agaattagaa ttctaggtca ccctacaagt taccctgcac agggaggaaa 53640 
ggaacaggca agatgacttc tcaggatctg tgcctgcgag ctgatgctct gagaatgggg 53700 
gttattttct tgggtgtcct gtcttctgtc atctaggcta aaaaatcttc ctcacttgac 53760 
tcatcacttg agaagacaac ttttggtttc ttttcggaag ctcgctgctg ggtgttcgga 53820 
tgccgatggg tagtacgacg gggctctggt gtgacgaagt cattgtctga ggctgcagaa 53880 
gccagaggct ggtgcctaca aacagtggat ggtttcttgg ctgttggggc tctctggcta 53940 
cctgcttggt taatctcaag ctcctcaatc ccatccaaac ctctggcatg acaggcctga 54000 
agcttttgct caaattcatc tattatagcc ttgccctcag gcttggcccc atgttgtagc 54060 
ttgcccataa ctgacaaaaa agaactgaag atggactcat ctgacagttt atctccaaaa 54120 
cagtcaaaat tgccaagcat cttgcaaggg ccttgctctg tgaggggcag ctgtgacaca 54180 
cagtaggcca ggtcctgctg cacctgctca gtttaggctg tgtggaaccg ctgacacagc 54240 
ttttccacca ggctttgtct gcttgtcttt ggtgatgatg taggagaaga actgcttcat 54300 
aatggtgtga aaaggcactt cctccacccc cagctcgggg tctgacaggt ggctgatgat 54360 
atctggaagg agactataga ttgcattgcc cttgtggaag agctcattga agaacttctt 54420 
ggccagggca gcaatttgag actcagggtt gatgagcagc atggccatct cactcacctg 54480 
cccctttacc ttcaccatgt ccttgaggat caggtggagt catcaccagc cccactgttt 54540 
tccacactgg ctgagcaggg tcctgaaggt gagcatatag atgaggagtc caggggtcca 54600 
ccagattggg aaagtggatg gccagatccc cagtggcaac gatgagatta gaccggacaa 54660 
tgggaagtgg agacttttct agcatggtga acagaagatg aggctgggag ttgcagaaag 54720 
tggtactgat catgcagaac ttgccaaggg taggtgaagc agctgcagag aggtctgggt 54780 
tgctatagag gcctgagttg ttgtagactt taagcaagag tggaacaaag gcagccagtg 54840 
tctgtttgcc atccaacagt tccatcttgc agaagccgcg gattagttct gcctccgtgt 54900 
catctgctgc cccaaccagc cccagctcct cctccatagt ggtctcgaaa cttgtattct 54960 
tctccttggg atctttggtc ttgtgctcct gttcttcccg gagaactcgg cgctgacaga 55020 
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gctctccact cactgcctgc tccaagtgga ccagctgctg cagagccaca tccccagcca 55080 
gggacaagag gttcatcaac aggaaagtgg ggagcattgt gggagactcc ttcgggtccc 55140 
cctgactggt tctcttctct tctagcttct ccagggcctg ttttgcacag ccctgccata 55200 
tctgggcaca gatcactgtg ggactctctg ccagttggta aatgagggtc actgccacct 55260 
ctttgaatgg gatccagagt gggtctggtg gacaaagcct ttcgggaccg tctcccgcag 55320 
tcactcaaac aacttgtgtt cctgaggcaa ctggaagtgg gggtgacgtt tgcccagaga 55380 
agtctttcta ctgtcagaga tgttggcgat ggcatggcac acctgctggg ccagccggta 55440 
gtcctgtgga aacttcggaa gagagtttta ttttctgcaa ccggtgacgg ggagaaggcc 55500 
tggaaattat tgccagacca acttaaaatt acaaagtttt ccagagctta tataccttcc 55560 
aaactatatg tctacgtgta aggtatgcat tcatctaaag atgtaaatgg ttaacttctt 55620 
ttaatctata accaaggtct gagtcctaaa taccttcctc tggagcctca gtaaatttac 55680 
ttaatctaaa tgggtccagg tgctgggggg attaccctta tcttgtctcc tgctaaatta 55740 
cagaggtttt gggagttcct tcagacctcc aataaacttg tttgtggagg cctaaggagt 55800 
ttccttagac ccccagtgaa acttgtttaa tcctaaatgg gtcctgttaa gaattccttt 55860 
gttattttgt catgccttaa gtcccaggaa aggcctaggt aaaactcttg atgggctttt 55920 
gttacattcc agccttcgta caggggcact ggcttttaat atttaactta accactcagt 55980 
cagtactgaa agagttgtca gtgacacctg gcctgccaca attatgagtg tccagaattt 56040 
tactagatgt ttgggggaag gggaatgaga agcctgaatt aggaatattc tccacgagtc 56100 
tctcacaatc tagttggaaa gactagtaaa ctcaatgaca tattaaatga taagtaaatg 56160 
acaacctgtg gtccttactt catgcctttg cacacactgc tctctctacc tggatgccct 56220 
tctctcacct gatctctcca gcaaagcact actcattcct caagacacaa agctgaacag 56280 
gcactctgtg aagcctgcct agtccgcttg cttccttccc agaagatttt gccaatcatc 56340 
cctcttttgg acaaatatcg tgcaacactg tattttaatt atttcccaac tccctgaaac 56400 
actagaatgt gagcaatttg aagataggaa tggagtttcc ttccttgcat tctcagcaca 56460 
tagcattgtg tctgaaacca aatagtcagc aatcaatagt aattgatgag tgtttaaaga 56520 
agcactacag gaagtggcaa aaatcagggt cctccagggc aacacgatgg ggacttcaca 56580 
gagtgggtgg agcttgtatt gggccttaga ggatgactaa catttggaga tgctgatgct 56640 
gccaaaagta gaaagatcag gaggaataaa agcccagaaa tacgagactg tcttccccag 56700 
tgttttatgg tggtcctaga ttcaaatatc ccatcatgtc acaactttga aatgaggaaa 56760 
atactttcac caagaagttg aatgctattc tgaattacaa acccaggtga gtgctcatca 56820 
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agaacagaag accagccagg tgtggtggtg cacgcctgta gtcccagcta ttcaggaggc 56880 
tgaggtggga gtatcacttg agcccaggag ctcatagtgc gctatgtgga tcggatgttt 56940 
gtactaagtt tggcatcagt atggttacco ctatcccagg agcaggtgac cactagattg 57000 
cctaaggaag catgaaccag cccaagttgg aaacagagca agtcaaaatt cccatgctga 57060 
tcagcagtgg gattatgcca gtgaacagcc actgcaotac aatgtgagca acataaagag 57120 
acgcccctgt ccaaaaaaac aaagaaggaa acctgoaatc cgagcacttt gggaggctgg 5718O 
ggaaggagga ttgcttgaag ccaggagtcc aagaccagcc agggoaacat ggcgagaccc 57240 
tgtctctaca aaaataaaaa taagttaaac ataaataaaa aattctaaat gaattttaaa 57300 
aagaaagaaa aaataataga aggctttggg gatcagggca gcccaaaggg attcctggag 57360 
tttcaaacac aaaatagaca tgagagactt aatctggatg tcagaccagc actagaagaa 57420 
aacctggaca ttttgaagcc cttaaattca taaaactttg acagtttcat gaacatttgt 57480 
tgaaaccttg ccatgtccaa gaaggtcacc taggatcaga acttccaccc atctcctctt 57540 
ctccccctgt ccttttctga ggtctacacc occtttgtag taggtggtca actggacaca 57600 
ggaotaggtt tatcaaaact tgctctcggg aaataaaaat aagacaagaa atacaatata 57660 
cacaagaaca catgttcttg tacctcagtc tctgaatagc ctgtctctag actctgtttg 57720 
atgtaagagt ggttgcttct gaaaaaagaa ctgagtgact aacaatggtt ggagaagggt 57780 
gcctttctta attttagttt ttattatttt tattgaagta tgaagttcaa tocggtgcac 57840 
gcaagtatca gctgatgaat tttcacaaac agaagatacc agtgtaacta gcacccaaat 57900 
caagaaatgt taccatctgc ctctatgccc cctcctacca attcccagta ccacctctto 57960 
cccccaccaa gagtaagccc tgaggcccag acaggcttca cctaatttta tattttattt 58020 
tttcttttta tgaacagctt tattaaaata ttcacataac aatacagttc acccattaaa 58080 
agtataaaac tgagtggctt ttaatatatt cacagatatg tataaccatc accaaagtca 58140 
attttaaaat attttcatca cctcaaaaag aaggcaaata ccttttagct ataactccca 58200 
cctcacatcc tccgtaaccc tagccaactg ccaatctact tcctatgtct atgaaattgc 58260 
cagttotcca oatttcatat aaatggaatc atgtaatata ttatcttttg tgactggttt 58320 
atttoactta acactttcaa ggttcatttt tgttgtagca ogtatccata tttcattttt 58380 
tatggctaaa taatattcta tcatatggat atagcacatt ttgtttatco gttcatcagt 58440 
tgatgattat ttcagtggtt tttacctttt ggctcttagg aataaaatgc tgctataaac 58500 
attcatgtao aactttotgt gtggacagtt ttttctcttg ggtatgtaac tagaagcaga 58560 
attgctgggt catatggtaa cactgtggtt agcattctaa gcaactacag ttttatattc 58620 
ccaccagcag tgtacaaagg tttcaattcc tcctcatcct caccaacatt tgttgtttgg 58680 
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ggattgttca ttacaaatac atgaaacagt ggttaccaga gtcagtcagg gtgggaggga 58740 
ggaatgcaga gatttaggtc aaggatacaa agtagcaaat acgtagaatg atttaattct 58800 
ttacataaat agaaccttac agtatgtgtg cttttgtttc tgacttcttt tgctcagcat 58860 
tagattcagg agattcatca acattgctgc gtatagttgc agacagttta ctatccttat 58920 
tgaatggtgt tctactgtgt gaacatacta cagttgcttt tttcattcca ctagtggtgg 58980 
gcatttgggt catttccagt ttagggctac tatgaatact gctgccttga acagtccagg 59040 
atacatcatt ttgtgagctg cttattttta ttatttgtgc tacctgaatt ttgtattatg 59100 
tgcaggtatt acttttttta aatgtcctaa aaacaccttt aagggctttt aaactaggat 59160 
cagaaactta gctccttccc tactgacaag caggcagaaa aaaaaaataa tagtaatagc 59220 
tagtatttac tgaatgctca ttgagccacc tctttaggga aaaaaatctt ttcatatatg 59280 
tttgctttta atcctccaaa ttactcaaag ccataggtat tctcattatg cccatgttag 59340 
acctgcagaa actgagaccg aaggaggtta aataacttgt gcaaaaccat acaactagca 59400 
agtagggaat cacagtacaa atacaggtct ctgaatataa agcctgtgca ttcaccctcc 59460 
aattacacag cctctgatta cactcagctg ggagttcagg gagcctgcta gacatgggga 59520 
ggagaagctg aaatgagagc acctgaaact gccagctcag tccagctcaa cagctcggat 59580 
tgttctcccc accgcccccc tacttcctgt cctaattaat caaatgctct tgaacatgaa 59640 
agagagtgag attggatttt tgttaagatg aaagaaagga atctgttccc atctggcacc 59700 
ttcaaaactc tcctattcaa gccagcaggg agcatttatg caacctacat ggcagtttaa 59760 
taattcccca agaatgccag gagattaagc aatgtcacaa ccgatctgga tccaccggca 59820 
cttgagaata atggaaagag caggggtcta aggtctagga caggggtcaa ccaaatttct 59880 
ctgtaaaggc ccagatatta aatattttag gctttgcagg tcctatgatc tctgtgtcaa 59940 
ctacccaact ctggcattgt aacatgaaag cagccatgag caaacaggca tggctgtgta 60000 
actaaaactt cttttatggc acagatattt gaatttcata taattttcat gtgcctgggc 60060 
atatatttct tttttaaaaa atattttcta accagtaaaa tatgtaaaag ttattcttag 60120 
tttgaaagct gcacaaaaat ggaggacaaa gtctagcgac ctgaattcaa atcctctctc 60180 
tgccatttgt tatttctgtg gctttcgaca tattacttac attcactaag cttcatttct 60240 
ctcttgagaa aatgagcaga gcaggtgcca tgcattcagg gcctcccttg tggctggcac 60300 
tgagccacat gctttatatg ctttactgaa ttaaatccta caataaccac aaaaggctga 60360 
aattatatga ttgccttctt cagatagtta cgatgattaa gaaattatgt ctataattgt 60420 
gcctgagctg gcctccatag ttcattgagt ttgatgcttc ctgttgtacg caaaaattgc 60480 
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catgttggtt cgatgagagc atgcctgctt tttctactac ctgtggtgtt ttgagtacct 60540 

atatttgtat tgcacttaca gtgaaaatta tctaattata ttattctatt taattacaat 60600 

tgcatcaagc tccaataaac aatccaaatt tcactgaact gaaagttgag tgcacacttt 60660 

gaatagtcac tccttctgtt aggttggtgc aaaagtaatt gccattactt ttagtagcaa 60720 

aaacagcaat tactttcgca ccaacctaat atatatgtgc acagtaggca gtttggctca 60780 

ggtacactta ttcaaaacat tgcattaagt aagataatac atgttaaata cctagcacac 60840 

agcctgggat gcaggtgtcg gccagtgttt gttccccact tccttagtga caaagattag 60900 

gcttagctac tttttgggaa aggtgagtgc caccagaaag atgaggccaa ggccttatca 60960 

gacacactgg aacacagcct tcccaaggaa taaatccaga atattttgtg tatgagttct 61020 

ggctctgaag cccagcctgt tcttacactg accccccagt ttctgccttg atcagccacc 61080 

cactaggctg ggcattgctt cctctcactc tgctatctag tgacccaacc tcggcccaac 61140 

cctccagctt tcttctggcc atgcgactgt tctgctagtc tgctccgcat atctgctcct 61200 

accttccgat gcctcctctg agctttggct ccactcccca ctcagcacaa acaacttact 61260 

ttgccccagt atggggtaca acagctcaga ctcttctaga ggtctaatcc tggacacctc 61320 

gtttgcttta taagcacaat ctggaaagaa gagtttcatt gctcagagcc cctcaggaag 61380 

cagaagagga gagtggtgaa gagagcattc atcctctgga atgaccataa ctggtgtttc 61440 

aatcccagct cccctccata ctaatacttg ggtgagttac tcaccaggtc tatgcttcag 61500 

tttcttcatc tgtaaaatgg aggttgtaaa aatatcatct accccatagg gttatggtga 61560 

tgattaaatt agttcttcta tagaaagtgc ctgcagcagt ggttagcatg tggtaggcac 61620 

tttgtatgtg ttatctgtta tgtctgtttg ctcttactaa catctatatt tctgagtcct 61680 

agacctcccc agagagtgtg aagttcatgg tgtttcatgg gaacagatgt aagcccactt 61740 

gctgaatctc tctctgagct aataattaac aaatacattg cacagtagct taccaggaaa 61800 

gcctatactg gtctgaacaa gaagaaactt gctgaacaat tctaaagctc acttctcaat 61860 

ataacagcag cccttgctca acatgttgcc catatcttca gtaccgtgca gaggtcacag 61920 

ccatcatatt ctgccagctc ttcccctatg ggttgagcaa ccatatgtgt cttcccaacc 61980 

aggacatttt tcatttttag cttcttgttt tgacataatt tcaggtttac agaaaatctt 62040 

caagagtagt acaaataccc ccatgtaccc tgtacacata ttcctgaaat gttaatattt 62100 

ttcatatttg ctttatcatt tcctctatca gtctatctat aatacctacc tatctaccta 62160 

tccatccatc taaaaatgtc ccaggtaatc agcaaacctg ctttaaatgt caccaattat 62220 

cccaagaaca tcctttttag cacaaaaaac cccgaatcat gtattgtatt cagttgccat 62280 

gtacgtttag tgttctttaa cttagaacag ttccttggtc tttctttgta tcttatgaca 62340 
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tcaacatttt taaatagcat aggttagata ttttgtggag tttctctcac tttgggtttg 62400 
tgtgatgttt ccttctgaat agattaaggt tatgtgcttt tgagagttat acaccaggaa 62460 
tgatgctatg ttcctcttag tatatagtat cagcagacac agtatatcaa tttgtcccat 62520 
tactagtgat gttattgtca gatttcbcca ctgtaaagat actatttttc cttttgcaac 62580 
tgacaagtat tttaagagta catagccatc ctgatactcc ttaaactttc actcacagca 62640 
cttaccatac agtgtagatt cctgtctgaa tcaatgctat aaaggtttcc aaatgatgat 62700 
tttctaattt cattatttct tctatactta ttagcagtct ctctacttta aggaagaaat 62760 
ttcccttctc tcccattttt aaatttacat cataggttct tcttttattc tgtgggttat 62820 
aatttgttgc catagttatt tctttttatg cctagattgc tgactggggt cagcaagagc 62880 
cctacaattt ggtgcgtgtt tctgttgaca tgtttctcag catttttggg catgtcctca 62940 
cttgccagca caacatgatg ttccaggctt atcttgtcct tttcccagcc ccaggccagg 63000 
aatcagccat ctccaatgag ccaagtaaaa ccatggcata tttaagagaa agggccacag 63060 
gcataatcat ggcgggtata aactgaaact gacctagaaa tgctgggtca tattcccaat 63120 
ctaccactag atgtccttcc acacctcaat agttcttcac aggaagcagt cctggggcat 63180 
ttggaaatgg tgggatattt ggagttatca caaagactgg ggcagcaggg gttacttctg 63240 
gcatttagta agcacagccc aggcatgcca aatttcctgc acaggaccat ctatcctaac 63300 
acagaattgc cccataagta gcaatcctgt taataaatag tggcccaatc cctgaactaa 63360 
tctatcttgt gttctagtct ctcaaggatt ggcttagttc ttttcacctt atccgccatc 63420 
agttaattta atcaccactg tttccatagt gagaatctaa tatattagac tgatcaagtg 63480 
tgttaatcaa atatattgga ttagctggca tgcattatag caaataaaat agccagtctt 63540 
cattttagat ctagtttaaa gaaagcctct acttcttatg tgtcaatttt tgcagaaaac 63600 
attattctag caagttggca aggagtctag gacttatgtt aagacacacc atctcatccc 63660 
cagttcttgc agctctctgt ctagttattg gccctattca ttgcatcagc accaacaaca 63720 
gctacccagg ttttataggg caagcatttt actaaacatt ttacataaac tatctcattt 63780 
aaccttcaca gtagccctat aaatcaggca ttcttattac aattttacaa agagccctgg 63840 
attccgtgtg gtcagtagca agcccaagat agcacagctg tcatgtagca gagccaacat 63900 
tcaaaatccc acagtcgggc tccagtagcc atgctgtgta ctgcccatgc atgtaagaac 63960 
atacccaaca caatgaaaaa taagaaatga aatagaaagg gcatggaaaa tatcaaatca 64020 
taactgcaga ggaaaaaagt caaatttaag cttagagtta ctgcaaagtt actgcaaccc 64080 
atcatatttt attttgtgct tcctagttag tggctaaaac taagggcaaa actgactaaa 64140 
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tataatttgt tgtaatcttg agggaaatgc acagcaagat gtttcttgat caagaaaact 64200 
gatactacgg agctttatat tcagtacttt gatgaaacag cactaaaatg tcctcaataa 64260 
catctctaca aagaatcagt gattcttaga cctctaagag gagccagtgt gatgcctaaa 64320 
cataggcatg taggctactt gacagacaga ctcagcatag cagtcatcat cacgtagtaa 64380 
tttctagcat cctagtagct ttctcactgg ctttagtttt gaataaatgg gatttttgct 64440 
aagatcttac attatgcaac agaggcaacc acaaagacca cccctagaaa atgaaatgat 64500 
cttgggcccc ttcctccctt acctcctgct tcacactgca ttctttggga agcggatgct 64560 
gaaacagagt caggtataca aaaatctggg gagaagtcac tcctgtgaaa gcaaagggaa 64620 
gaagtaagat tgggctggag gacccatcag agcctgacat taacctgaca gtctctggca 64680 
gctcaggaat cctttaggat ttccatgttg ggtataaatg attagactct tctactataa 64740 
cttagtcatt ggtagaggtt gtaccaagaa tagcatgatc ccagctaaaa agctgaggca 64800 
aaccctgagg aatctaacag ctggaggatg tcagcaaacc atatacctca cagctggaca 64860 
gcaaggtctt tcttgaaggg ggatctgagc agcatatctc tatgtctgcc tcatttccct 64920 
atgtacaagc gtgtacctga ttgcagcaga atgagatgag aaacaaatat ttttttgtta 64980 
caatttacca cttccatctc agcctgctac cccttagctc tccttgggca aagggaaaaa 65040 
gacatttgaa gagcctgcct cttggataag tgaaaatgga actctcctct ccttctcttg 65100 
gatttctaaa catcccagat catttctgtg aattcttttt attaggaaaa tagcatcctt 65160 
taaaagagaa aacaagttgt ctatgtctta tattcagagt cagtttgagc cccatgttgt 65220 
gcaatgatct tgtgcaatga taattacatg ctaccatgcc tcaagaaatt tccccttaag 65280 
tggagtgagg aggagaaata gtggaggctc aatgtcttgt tttaagaaaa cacctgttat 65340 
tgctgaacct gatgcttgtg cttatgagca caggagtttt ttgtggccca gcaataacca 65400 
acaactcagg taggccagtt gcctcattta atgagcttct tggtttctga atacaacaga 65460 
ggcaaagaga gtaagtttta cgaggacttg ggtttcctta attagacaca agctttttat 65520 
ttggttgggg gaactaatgg agaaagggag acactagcgg catttgtaac ttgttaagac 65580 
taattgaaaa tcttaggacc aaagctttct gtgaagagag tgctctaatg agaataaatg 65640 
tgatgattat ttacctaatt gagcagtaat gactgaggtg tgagccactt ccccaggatt 65700 
aataaagagc cggggagttg acagtttaga ccattaggct tagctggtac ttcatcttgg 65760 
cgaaggacac agtgccagaa tcatattaat attgtaaaat gagcccaaaa catagacata 65820 
tacttttgga gaaataattt taattggcta agtacctgac atcaggttgg gccattcagt 65880 
tttatcatta aacaaagttt cacacaagtt cttcaagact tgtgccattt acagtgttct 65940 
cagcattcag tgtttaaaaa agaaaagtag aagagggacc tttatcaata tcaaaaatag 66000 
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ttttttgtgc aaccaaaagc aagcaagcta ttattatata tacacactcg ctttaaaaac 66060 
aacactaaag acattgagtt gagctggaag cattgcatac tctgagatac acagttgata 66120 
ttgttaccca cagtgtccct tcaggaaagc cagaggtcat ctgcaacacc acaattaagg 66180 
ataaaaggac cttatattat cagaaatttc aatgtcttca taaaaattgt gtttatattc 66240 
aatatctccc ttatttagtg tgacaaatgt tcatgtacga gtcatctcaa gaaaaagagt 66300 
tactcagctt gctacgtatt cattattcta tggctctgct atgcagggga aagtgcaaac 66360 
agcaactgag agaaagcagg aatctaagac atgggactca acttgccctt tacacagggt 66420 
gaatcctaca ctagtgacct ttgtgtttta aggaagtttg caaatatgtt taattggaca 66480 
acatttcttc ctttatctag caagcactga gtatcttgcc aggcactatg aaaaaagata 66540 
aaactataaa catgagcaag gcacagccac tgccaatcag aagcttacaa tctagcagag 66600 
atcttaaaat gcagggagct cttttagaaa tctgcttagg gatcaaaath tccaatcaca 66660 
cttttccttc cagcatcttt attctgtaat cattatttta atcttcattg aacgagatga 66720 
ttaatgtggt tccatctgct gtatgaaatc acccatgcta cagtcagcca atcaagcaac 66780 
agcagtaata atgtactact tcatttaatg gaaatttacc aagctgttac tatgtaatat 66840 
cacagtttta gggcctggag aggtagagct gaactaaaaa taataatctc tgttatccca 66900 
tagtatagtg gaataggaaa gaaatacaaa aaaaaataat gtgttcatgt attataggta 66960 
ctataatagt agggtaggaa gatacacaga agtgggaata accaagccaa tctcagtact 67020 
cagaaaaggt gttataaagg aggtaacctt gaattaaacc ttgaggaaat ggatattctt 67080 
caggtagagg tgtataaaga gcttcccaca ccaatggagc agcataaaga aatggcatgg 67140 
cagagtgaac agttgggtat ggccaaggaa actctcaact gcatcttctt cactgcctca 67200 
actgatcact caacgctctc cattctctca gcaaacaccc tgactaacta aactcccatg 67260 
actagcagga tattattaga cttaacaaac ctgtcaattt atatgctttc cctttagaaa 67320 
gacttgactt ttttttagtg aactgcaata gaactaagct caggagatgc aactatgaaa 67380 
gatgaaggaa cattataaaa atctctgcaa tgagattctg ccttagattg ctttgctggt 67440 
atgtcaactg attactccaa tttaaagaaa cccaaactgg aaattataaa agataaattg 67500 
tgagtgtatt ttatgcaaaa aagaatcaaa tcaaatcaaa gtcttggctc tacattaaac 67560 
aaacgacaaa ggcatgtttg tttttatcaa aatgttttag ggtagatttg tatgtatgcg 67620 
ttatttcttt aaccaatgtt ttaatcagtt aaacagctgc tattattaat gtgctggttg 67680 
aaagtgtttt tcttaaagta caaatttaaa atagaatgag aaaagatcag acttgagaag 67740 
aaggaaagtc acagtaggac atttggaatt tattctctag accagaggtc tgcaaactcc 67800 
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agcctacagg ccaaatccat cctgcagctt atttttataa ataaagtttt attggattac 67860 

agccaggccc atccagttac agaattgtct atggttgctt ccactctaca atggcagatt 67920 

tgagtagctg ttacagagac tatacgcaag tgtgcaggag agtttagacc acacctacag 67980 

gaaaggtatg aaatactacc ctccttccta gacacttccc caaacaaaaa acttaagtca 68040 

gtgtgggaac aaacactgtt gccccacaga gcactaataa aggcagataa atgactgaga 68100 

aaaccctatc ccttggggac ggaggccaga aatggcaaca ccaataccat tggaagtctc 68160 

ctacttctag gtaaggacat gaaattctct catatgcaac acccatcaca gatacaaggc 68220 

aggagtttga ttttcatggg gaggaggaat tggaacactg agaatacccc acccctaagg 68280 

cccaggcaca ctggcctata tgggtctgtg gctgaactgg gacaaaagag aagccctaaa 68340 

accaccagta gcaagcattg agtaacaagc agtagcagta ttacagcaaa acaggcacag 68400 

ctgaaagcag agagtggggc attaaaaccc ttcagcgtct cagctactat gctaatagaa 68460 

aatccaaaga aaaatctgaa gcccatagga ggctgaggat gaccctacta acaacaaaac 68520 

tcaagtctag ctagactcct aagcagattg atttaatacc ccactctaac aaccttaaag 68580 

aagtgtgccc atttccatgc ataaatacta tgtatctcaa tatttgatat actctacaca 68640 

aaatgtttgc catttaatca aaatgcatga gactcaaaaa gaaaatgcaa cccactgtta 68700 

acagccaaag caatcaacag aaccaaacag agacatgact ccaatgttgg agctattaga 68760 

cacggacttt aaataaaata attacaatga ataagaaaaa tacattaaaa atacttcatg 68820 

aaagaggata tatggatggc aaagaagcac ctttaaaagt gttcaacagt ttatacacta 68880 

ttgatggaaa tgtaaattag ttcttccact gttggaagca gtctggagag ttctcaaaaa 68940 

acttaactac catttgaatt agcaatccca ttactgggca tatacccaaa cagaagaaaa 69000 

ctgttctccc aaaaggacgc atgcactcat atattcatca tagcactatg tatattcaca 69060 

ataccaaaga catggaatca acctgaatgc caatcaatgg attggattaa aaaatggaat 69120 

actatgcagt cgtagaaaag aatgaaatca tgtctttaca gcaacttgga tgcatctgga 69180 

ggccattatc ctaagtgaat taatgcagaa aeagaaaacc aaatgccaca tgttctcatt 69240 

tataagtggg agttaaacct tggatacaca tggacataaa gatggcaaca gtagagggga 69300 

aaaggaagaa ctagagcaag ggatgaaaat ccagctattg ggcattatgc tcagtacctg 69360 

tgtaatggga tcaatcgtac cccaaatctc agcaccacac aatataccca tgtaataagc 69420 

ctgcacatgt accccctgaa tctaaattaa agttgaaatc atttctaaaa aaagaaaaga 69480 

aaaatgaaca aattgaaaat aaaaagatgt tcaacatgat tagtcattag agaaatgaaa 69540 

aataaagcca tgatgaaata tcactacata tctagtagaa caactaaaag aaaacacact 69600 

gacagtcctg agtgctagaa agtgtgtgca accacagtct ctctcataca ttgttggtgg 69660 
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gagtacacag 


tggtgcagca 


acattggaaa 


actgctgggc 


agttttttat 


gaagttaaac 


69720 


atatatttac 


ttactctgtg 


gcccaacaat 


tccacttctg 


agtatttgtc 


ctagagaaat 


69780 


gaaaatttat 


attcacacaa 


aaacctgtac 


atgaatgttt 


atatcagcct 


tgtttgaaat 

^m m^ 


69840 


aacaaaaaac 


tgaaaataac 


ccaaacatat 
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gaataaacat 


attgtaatac 

mm 


69900 


atttatacaa 


tttaaaacta 


ttcggcaata 


agaacaaaca 


aatgatacac 

^m 


acaccttaga 

m^ 


69960 


agaatttcaa 


aggcattatg 


atgaattatt 


tttttttttt 


tgagatggag 


tctcactctg 


70020 


tttcccaggc 


tggagtgcag 


tggcgcaacc 


tcagctcact 


gcaagctccg 


cctcctgggt 

•V ttv mm 


70080 


tcacgccatt 


ctcctgcctc 


agcctcccga 


gtagctggga 


ttacaggcgc 


cagccaccac 


70140 


gcccggctaa 


tttttttgta 


tttttagtag 


agacggggtt 


tcactgtgtt 


agccaggatg 


70200 


gtctcgatct 


cccgacctcg 


tgatccaccc 


gcctcggcct 


cccaaagtgc 


tgggattaca 


70260 


ggtgtgagcc 


accgtgcccg 


gcctatgatg 


aattttttaa 


agctaatttt 


taaaggttgc 


70320 


atattttgta 


attccattta 


cgtaagattc 


ttaaaatcac 


aaaactataa 


tgataaagaa 

^m mm 


70380 


cacattagtg 


tttgctagga 


aatgggtggg 


tagggagtgt 


ggctataaag 


ggatagcaca 

mm mm mm 


70440 


agggaatttc 


tcgacagttg 


tagaactgtt 


ctctatcccg 


actgtggtgt 


tagtgacatg 


70500 


accttatatg 


tatgtgagat 


taaatttcak 


tgatatatga 


catgcttgta 

^m mm 


aaaattggtg 


70560 


atgtgcaaat 


aabtctatag 


tttagttaac 


agtattatac 


caatgtcaat 

mm 


ttcctggttt 


70620 


tcatcattat 


actgatattg 


accaggtaga 


gcgtacatgg 


gaaacttctg 

mm mm 


tcttattttt 


70680 


gcaactttaa 


tatgggtctt 


aaatgatttc 


aaaataaaca 


ctttaaaaca 


ttctccctac 


70740 


taaataacca 


tgattaatag 


gttaaagttt 


ccaggggaaa 

9am mmm mm 


aaagtggaca 

mm mm mm 


acatagatga 


70800 


acagttgagt 


gatttcacag 


aaagataaaa 


actataagca 

^m 


agattcaagt 

mm mm 


agaaatgtgt 


70860 


gggggaaata 


aatcacaata 


gtaattcaca 


gatttaattc 


taggaatcgt 

W ^ mm 


agactttaca 

mm 


70920 


cagccagagg 


aagaattagg 


taatctgaag 


atagatccat 


aaaaattgcc 


caaagtgaaa 

mm ^mm 


70980 


atcagtaaga 


aaaggacaaa 


gaaagtggat 


gaggagaaca 


aagtatttaa 


aaatgatggg 

mm mm mm mm 


71040 


acaatatcaa 


atagtctaaa 


atatatgtca 


ttgaagttac 


agaaggagaa 

V mm mm ^m 


gagagaatgg 

mm mm mm mm ^m 


7X100 


gcaaaagaaa 


tctctgaatg 


atagctggaa 


ttctccaaaa 


ataatgaaag 


catcaaatca 


71160 


cagacccaaa 


attcttaaat 


aacctgaagc 


agaataaatt 


ttaaaaagct 


agctataact 


71220 


tatccaaagt 


accaaataaa 


ggtaaagaag 


aaattgaagg 


aagacagtgg 


aaaatatatg 


71280 


aacatattaa 


atcagagaaa 


ttcttaaaga 


atacaaaatt 


acagctagat 


agaactagaa 


71340 


taagttctag 


tgttccatat 


tactgtagaa 


ttactatagt 


taacaataat 


atatggtttc 


71400 


acataactaa 


aagaatattg 


catgatcctc 


acaaaaaaat 


gataaatgct 


agagatgata 


71460 
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gatatgccag ttatactctg ataactatat gtactgagac atcaccatgt agccccatga 71520 
atatgtacac ttattttgtc attttaaaaa ttaaattaat ataaagaggt aaagacaaga 71580 
attatagctg acttcttatc tgaagcaatt aaagccagat gataaaacat atcttcaaag 71640 
ccagctgggg ggacaactat cacccagaat tctacatcta gcaaaaatac ctgtttaaaa 71700 
tgaagataca ataaggacta ttcaaaattt ttgtaatggg agaattcctt accagcagac 71760 
ctgtacttta agaaacatta aagaaagttc ttctgacaga ggaatgtgaa accagaccgg 71820 
aaattttatc tacagagaga tgtgaagaac tctgaaaata gtaaaaataa agattaatat 71880 
taaaagccac atttttctta atgttaaatg actctaaaat gtaattgact acttgaagaa 71940 
aaattgtagc agtgagattt ataacccatg taaaatttaa atgtctgata acaatagcac 72000 
aaagtttatg aggaatttga atgtctaagt cttcacaata tgtaagaatt tgtataatat 72060 
tatataaaac atctaatact cagtctttac ttgctgcagg gaagatcaat ttggcaggtc 72120 
cagcactgga attaccattt agattagtta ggaaactgcg gaagttcaag aaagaaatca 72180 
tcagtaccta gagtatgcca aataatagta agaatacaga gaggattcaa atttgagatc 72240 
tacttaataa ggtaaaataa ggaggatgcg gtgactgagt attgagatgt tctgaatgtg 72300 
aggtagaggg aagagtctac tatgtattat gggttcctgg cttgggtgtc cctagcagac 72360 
agtggtgcta tcaagtgagt ctgaggagag aggaggatag gatcattaat atgaaaagtt 72420 
aatgagtttg gattttaact ggggtttcag gcaatttaca tgtaatgtta ttattgatgt 72480 
gtttggattg aaatctacca ctggtttctt gcttttattt gttccatcta tatttttctt 72540 
ccttttttcc ttctatttca gcctggatta attgagcttt ttatgatttc attttgtctc 72600 
cactattggc ttattaatta tgcctctttt tctatttctt ttaatgttgt cctaggcttc 72660 
acaatgtaca tcctctctac attcctacag ttatttaaca tacgtttctt ccttgaactt 72720 
ctgcatagtt tcctaattaa atggtaattc cagtattgga cctaacaaat tgatcctcac 72780 
cacagcaact aaagaaatct atttataata catgtttgga gacataaaca ttcaacaact 72840 
ctcacagtat aaaaaaatac atcctctttt aattggcatg tgagacccag aagtctggat 72900 
tcttcttgtt tctgcagacc catttttgcc acttcctagc tcctatttag gcttcagcaa 72960 
taacaaacag atgcttaata tacacaagcc aagtagtttc tcatctccat aactttgttc 73020 
atgctgttat atgtgcctag aatcaatatt ctgcattcct ttctcacctg accaatcctc 73080 
actgaaaagc tcagctattg ccccattctt gacttcttcc ttgtccaccc gcttttgtgc 73140 
tccaactgga accccattca tgtttacaag aaacactcaa aggacaataa agcctgttgt 73200 
gactctactt cggcttttct gagtattaaa atttcctaag atctttcaca ggacaaattt 73260 
ttaaaagcaa gagtaataca tggaggtgat taagccttta aagtgtaccc tttgttccca 73320 
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aggtcttgtt aattatcatt taggagcagt ttatctgcaa tgtgctcaat tacttaggta 73380 
actaaatagc tttttactga tcagcattga acatcatttg cttaatagca actaacaaat 73440 
gcggtggcag aattccactg atagtctact ttcagaaata ggcttattta agattgagct 73500 
ccaggggaaa aatagattct aacagaatag attattgtta tttacataac cacttaggtc 73560 
tagagaaaca aaataatctt tccgtttttt aaaattaagg aatcagtaag ccaaaagaga 73620 
atatccaagt aactcccaag tccattccgt tcattcatcc agtccatttg tgccatgaaa 73680 
atcttcatga cattcctatt tgatcaatca caactcacag ctctcaatga ctgctcacat 73740 
ctgaccttct cagtaaggac ttttcttata gcttagaaca gtcataggat ttttcctctc 73800 
caagcttttg acactttgga ttatagatcc catgtgcttg ccaatgcttt ttttgttaac 73860 
tgtagaaaaa taaaaagcaa atacagacag gcgaaagact cagagggtca cataacattg 73920 
tatggtgaac tataccatgt gaatggtcaa ggaaagcttc tggagcatgg gacatattag 73980 
ctaggtgttg aaagaaaaat aggaattcaa taggctggaa aatgggggga gggtatttca 74040 
agcagaggga acacctatct agatgcacaa atgtatgatc aatatggtgt tcaacattga 74100 
aatgccacat ggggacctga caagaggtag ttgtattagt ttgttttgfcg ctgctgtgat 74160 
ggaatgccac agactgtata atttataaac agaaatttat taggacacag ttctggaggc 74220 
tgggaagtct aaggtccagg tgccagaaag ttcagtgtct ggttaggtca ttctcttcca 74280 
agatggtgcc ttgcatgctg cctagatcct cacgtggcag aaggccaaag ggagaaagag 74340 
gatgagccca ttcacttttt ataatggcat taatctattc atgagggctc aacctaaatg 74400 
cttcccatta ggccccactg tcacattggg aattaagttt ccaacacacc aattcttggg 74460 
cacacattca aactgtagca gtaggcaaag ccaaatagtg gaaggctttt attctaagct 74520 
gtgggactca gctttattct gttaacagtg aagatgcaga aagggcgaga tgcagtggct 74580 
catacctgta atcctagcac tttgagaggt tgaagtgagt ggatcacttg agcccgggag 74640 
ttcaagacca gcctgggcaa catagtgaga tcccgtctct atgaaaattt ttaaaaagta 74700 
aaagttagct aggtgtggtg tcacgtactg ttcccaacta ctcaggaggc tgaggcaaga 74760 
ggatcacttg atcctgggaa gttgaggctg caataagcca tgatcacacc actgcactcc 74820 
agcctgagtg acagcaatac cctgtaccac ttccccacca aaaaagatgc acaaaaggga 74880 
gatttatctc cctagttttc taaaggagca aaagcaactt ctgtgggtgg aatagcccta 74940 
taagtagaag gggccaaaag ataaaaaggg caaccaaaga agggaatgag gggatgagaa 75000 
gaagacaggg actcatatct gcacattagt ctagggaata aagaagaaaa gacacaacat 75060 
caatctaaaa agaaataaat gaaaacaata aaaggcagtg ttgaaggcag tcgagcactg 75120 
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acttctatag tatgggttaa agcccagctt cacctctaac tttctctgaa atcttaaaat 75180 

gttctaacct ttcttatttt gttcatctat aaaatggaga ttttatctca gtttgttgtg 75240 

agattaaaca agttaatgat tctaactact tagaaccttg ccaaggatat aataactgct 75300 

taataacccc tcacaatcat gttctgtgtt ctagagaagt caggttaagg gtagtaattc 75360 

tcttttaaac catgaagtct acctagagtt atgtaggatg tcaccattgg gggatgcttt 75420 

gcggtaggca catgaaacct ttctgtacca tttattgtaa catcttgtga gtctctaatt 75480 

atttcaaaat aaaagtttta ttttgttttg tttttcttaa agccttttca aaccacaatg 75540 

gcaattcatt caagcatgtt ctcagcactt gctgcatgcc ataccctgag ctccttctgt 75600 

tctaacatta gtaaacctca ttttccaata ttcagaaagc aattaaccaa atgacagtgt 75660 

gactttccat ttttactatt gctctttagc agaaacatag caataaaaat aaactattgg 75720 

aaatgaaatt atgagtatcc tagattaatc cccaaagttg cagtttcttt tgttgttcag 75780 

ctgtttgatt ttctaagagc ctgaacccag ccaagtgcat actctatttc tgagcagagt 75840 

tctgcccagt tctctgtctg actagaattg cacagaaacc acaaatctct cataaagcct 75900 

aatggtgttt ctcaacagag gcatgattgg aattctgagc tggataattt aatactaggg 75960 

tacagcccat gggtagtaca gaaaagagtc aatacttgcc ttgtctttgt acaaccagga 76020 

tgtatgtctg catgacaata atgaagactc aatttcagcc cccattcacc cttccacaac 76080 

agcaggggca gggcacccta gagcataaat aatctgaaca accctgtgtt gggggcctgc 76140 

tgaagatgct taacatccct ggccccatcc actaattgtc tgccatcaat cacaacttga 76200 

cagttacaag taaattgcct gtgttctccc catatgcctg cctttctcat tgtgtaaaaa 76260 

cagctctacc tccatctgct agtcactacc aatcactctt gccaagatgg tgactcctct 76320 

tcttgcctgc tagtccctag acacaaagag tccaaagtgc cctggtggca gccatagctt 76380 

ttagttcaat ggtacctttg ctatgttccc tggcagaagt acacttcctt tggtgaccag 76440 

gacctccaag ttgaagagca cagaggtgca gatgaagaag tgcaaagccc gttagtgggt 76500 

caatgggagt gatgttaact taggccattt cttcctgtac cccttgattc taagacctat 76560 

gtatacttcc tgttagggac ccaacaccat gaagtgggct ctgatttaac acatatgctg 76620 

cactttaaag gatgggaccc taaactttca gagcattacc tttaagtggc acttcatatg 76680 

ttccttcagt agttcattct agggctctaa aagtacagct acttctgtct ggtgaagtat 76740 

gtaatacaaa cagtaggttc taggatcaag agctatccta cacccccttc acaattaagt 76800 

gggtcccctg atcacatact atgttgtggg atttccattc ttgtttggta atctaaaagc 76860 

ccccagataa cagttagctg tgttgactat gactcagaga tcaggaaagg aaagcttata 76920 

cccagaataa gtgcatcatt ggaaggatgg acagctggct gatgcaggac agaggggaac 76980 
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caatgtagtc aacttgccac aaatcaacca gtcgaatctc ctgaagacct agtaacctat 77040 

tgagggctcc ttgttggtct ctcttgttag caagttggac attcaaaatc agcagtagct 77100 

agatttacca ggataattgg tagtccattt tattggctaa tgagtagtct tgctctctgc 77160 

cactgtggcc actcgattca tgtgcccatc atgccagtac ttaatccaaa gacaaagcct 77220 

ggataatgtc aactggacaa gttatcttgt ctatttgttt tttcagtgct tcttgtgtac 77280 

ttgatgcttt ccaattagtg ttaaacgcat gacacaaaaa tcttcacaca ttgtgcccac 77340 

tcccatagtc ccaaccacat gtctctgtct cagactcttg tttccaatct ccaagtccct 77400 

ttcattcagg tccctgtcca cacagctaga ctattggcta ctgtgcagaa acctatacat 77460 

atattcacac cttaggccaa tttttcttcc acacgaatag ataaccatat tcattgcttg 77520 

cagcttcatt cattgggaag accttccttc tccactgtct tccaaggcta ccccataatt 77580 

cagctgtact tcagtgatca tctatttttg ctttacagct acttaaggaa ctaacccatt 77640 

gtcagccagg attggccttt tacctccctt tcagctagtc atagagaacc cccatatagc 77700 

cataagtgtg agctatgcaa gggcactggt acaaccctgg tgggtgacat ggaggcatct 77760 

gtgctacttg ctcatgcagc ctcttcatgc cagtctagtc ccatatgggt ttgatcccag 77820 

atgtgccatt tccattttcc aatagaaggc tgtgggacac gtctgagtgt atgacttagt 77880 

gcattggcaa taatctagct cataatggtc agttccaaat acatttggtg ccccattatc 77940 

agctgttcca tcttgaccaa ggctcagcaa cattctagga agggtttctc aaaaggtgta 78000 

taactttgta ctgaaaatgg atggcattgc ttcagagccc taggggcctg cattctgatt 78060 

ctttcgcttg ggcatccatc aactccaaac tgtatctttc cccactaaca cattcagcac 78120 

catggggfccc accagatcat atgcctcaag tagtagggtt gcttatactg cagcccatac 78180 

atgctgcaga gccctttctt gctttcttgc tccaggccca ctcaaagctg tgagacttct 78240 

atgtagttca gtatatggga ccaaaaatat tcctagatgt ggaatgtgtt gcctccaaaa 78300 

cacaaagagg.cctaccaggt atcttgctct atttattgag attgagaatg taagatacag 78360 

caatttaact tttacttgaa aatgcagtgg tgggagtggt attctagcag actttcacta 78420 

ctgaaccact aaaaacttta ccgatgtggc atgttcctga attttcacag ggtttgtctc 78480 

tcaccctcca gagtgtatgt gtcctaccaa tgcctctaat gtattagaca cttcttgttg 78540 

actcttaatg acatctttga tgtaaatgga tcaatgtgat attctacaga atgttcagat 78600 

ggtccatgtc tcttcagacc ttattatgac agatgtttga agaggtaaca tagccatggg 78660 

caaagcagaa aacgaatatt attgtctgtt tcatgtgaat acagactctg attttttgga 78720 

tgtgtctttg tctggttttg gtatcaggat aatattggcc tcgtagaatg agtttggaag 78780 
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tattccctcc 


tcctctattt 


ttcagaatag 


tttgagtaag 


atgatattag ttcttcatta 


78840 


aatgtttggt 


ataattcagc 


agagaagcca 


tcagttcctg 


aacttttctt tattggaaga 


78900 


cattttatta 


taagttcttg 


ttatttgtta 


ttggtcatag 


ttcaatcttg gtacattgta 


78960 


tgcatctagg 


aatttgttcc 


tttcttctgt 


aatttccaat 


ttatttacaa tttatttatt 


79020 


tgagtcttct 


ctcttcttta 


ttagtctggc 


taaaggtttg 


tcaattttgc atgacatttc 


79080 


aaaaatccaa 


ctttttgttt 


cattgatatt 


ttgtattgtt 


ttcttcatct caatttcatt 


79140 


tatttctgct 


Gtgatattta 


ttatttattt 


tctgctacta 


attttgagtt tgttatgctc 


79200 


ttccttttct 


agttctttaa 


gatgcatcat 


taagttgttt 


atttaaagtt tatcttcttt 


79260 


ttcaatgaag 


gtactaatag 


ccataaactt 


ccctcttagt 


gattttgctg tatcccatag 


79320 


attttggtat 


gttgtgtttc 


tattatcatt 


tgtttcaaga 


aacttttaaa ttttcttctt 


79380 


aatttcttca 


ttgacccacc 


agtcattcag 


gagcatatgg 


tttgatttcc atgtatttgt 


79440 


atagtttcca 


aaattcctct 


tgttattgat 


ttctagtttt 


attccattat ggtcagagat 


79500 


gatgcttcat 


attatattat 


tttattttgt 


tttaatgttt 


tcaggctttt tagtgattta 


79560 


acatatagtc 


tactcttgag 


aatgacccat 


gtgctgtgga 


aaataatgtg tattctacag 


79620 


ctgttgtttg 


gtctatagcg 


cagattaact 


ctcgtgtttc 


tttaccaatt ttctgtctgg 


79680 


aagatctgtc 


caatggtgaa 


attgtggtgt 


tgaagtctcc 


agctattatt gtatggggtc 


79740 


tatatctctc 


tttagctcta 


acaatattta 


atttatatgt 


ctgggtgttc cagtgttggg 


79800 


tgcatatata 


tttacaattg 


ttatattctc 


ttgctgaatt 


gacccctttt tcactatata 


79860 


gtaaccttct 


ttatctcttc 


ctatagtttt 


tgtcttgaaa 


tctattttgc ctgatataag 


79920 


cataactagc 


cctgctcttt 


tgtggtttcc 


attgacatgg 


aatatctttt cccatccctt 


79980 


tattttccgc 


ctacatatgc 


cttatgtctt 


tattggtgaa 


gtgtgtttct tgtaagcaac 


80040 


agatcattga 


atcccttttg 


ttttttcaat 


ccgttcagcc 


acttggtgac ttttgattgg 


80100 


caagtctcag 


agtctcatcc 


aaagtcttca 


atgtacctgg 


gtattgctgc tggttattct 


80160 


gagcccaggg 


actctttagt 


tagcaggtga 


tgaatgttgc 


caggactatg tccttccctt 


80220 


caaggcagta 


gtttcccttc 


tggcctaggg 


catgtctaga 


aatgccatcc aggagctaga 


80280 


gcttggaaag 


gcggcctctc 


agagctgact 


agtgccctct 


cctgttgtga ctgtgctggt 


80340 


atccaagatg 


taagacaaaa 


tccttcctac 


tccttccttc 


tccttcttct cctttcctta 


80400 


agtggaagaa 


agagacctct 


tttggagctg 


tgagaattgc 


agcctggggt taggggaggg 


80460 


gtagtgccag 


aactccctta 


gccacccaag 


ctggtatctc 


agtagtccat gtgcctcccc 


80520 


agtgtactgg 


ctctgggccc 


agttcagaac 


tagaacttgc 


ttgaaagttg cactccttgt 


80580 


ggcctagact 


gactttcaag 


tgtatttaga 


gccctagagc 


actttagctt gcagtggtaa 


80640 
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ggcttgtggg aactcaagtt ctgaccacca ggattggtga tttccttttg gctagagcta 80700 
atataaatgc tgtctccatg gggtggggca tcagctgagt ttggcccagt tttcttttat 80760 
gctataatag gacagcacta agttcaatgc ctcacaattg ctgtgctttt ctcccctggc 80820 
acctaggaat gctctccaca ccctgctgcc attggtgggg gatgggagag gggtggcgtc 80880 
agagatttaa aactgttttt tttttttttc tacttcttca gtgcctcttt cagcaatatg 80940 
aagttaaaac caagcactat gagagctcac ctgatttttg gtgcttacaa agctactttt 81000 
tttgtgtgta gatagttgct aaattagtgt ccttgctgag ggaggaagag ggaacgatca 81060 
gtggaacctt ctattctgca accttgcctg cagcactctt ttgagaaaga ttctggatta 81120 
cgtgataaat tagtgatgtc tgtcatggag gggtgagggg cagtttttag acatatagcc 81180 
tatcttgcca tttctgctat aaacatctgc acagcttgta caatctgtaa ctatagaatt 81240 
gagtacagat gacctgccca caagaggaaa aatgtcctga tcttggcctt agtgcctctg 81300 
taattcaacc aagtatatta attaaatgga tagatcatac acttgctgtt tcccaagtgt 81360 
agaggttgaa agtggaaagg agaagaaagc aaagaaaact gggaggaggg gcactaagac 81420 
tgggcataag gagaaacaag aggtccaatt tgcaaaataa tgggtgtagt aaggggaatt 81480 
ttgaagtatt ctccaaagat aacgattttg cttttctcaa ttttgtcctc ttgattctca 81540 
cacagtgctt tctttagtac agacctacat accccgaaaa cttctgttct gtgacttaaa 81600 
aagtcaactc tatgattagc atgattcctc ttcagtcatt actgaaattc ttcctaggtg 81660 
gttacattta agtggtaaga agcttcagta atataagtca aagaactcct accttgcttt 81720 
gcatttcctc agtggccatc tgataaagca ggaaggaaaa aaattaaaaa tgaacctccc 81780 
caggatttca tttctattgt ggcttagctg caaatttgaa aaataaaaat aaaaataaac 81840 
cctgaatgta agcacttgta cgtttttgtt agattctttc acaggactgg tcaacatctt 81900 
gacagatatt aattggcgat tcactggaga cttcacggca cctgacctgg tttgccgagt 81960 
ggtccgctat ttgcaggtat gtcacacctt ccaaatgtga tgagacaaac tgataccaga 82020 
gttagctttt agatttatct aaggaaaatc atataaaacc ttgcattcct atgcatcaaa 82080 
ctgcccgctt tctcctgggt agaagggcaa tgaaaatttg atttttaatg tatttccttt 82140 
gctaaaaata gtatctatat aaagaaaaca caaacgagcc acaaatgagc aatgctaagg 82200 
acataaagca ttcatatttc aaatgttaaa taaaatgtct gggtcaatat taaggatttt 82260 
ggcatattgt ggataatctc caaaaagttt gttaattata tttacctttc acttctcaaa 82320 
tgaactcaac gctggctact gctctttttt gtaaaccaag gggaatttgt tggaacataa 82380 
ccattaacta ttgaattact gcctaaatta tactttttaa agtatctatt ggcatttgaa 82440 
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tttcttcctc tcaaatttcc tattttcctt tttgaatgat tttataactg agagactttt 82500 
tagtttttct aagatctatt tttagattaa agatattgat gcttcaaatg tcaatctgct 82560 
tcattgtaag ccatatttat gtatcatatg tatatttgat ttaaaatatt acatatatac 82620 
atatttacat atatacatac ttgttaatat gcattagatg tttataatat agatttagac 82680 
acttcaacta agtctggaag gaatggatta tttcataact ggtattaggt ttgatgtgct 82740 
gtttctgagt tcttaagatt attttttatt aatatatttt tattctgatg ctagtacttc 82800 
aggtctaaat tattatagct ttatgtatat ataatgcatt tctagaaagc tttatcctgc 82860 
tagattccat ttgttagtag actcttttag gatctttgta tatatattca taagtaaaat 82920 
tttaattttt ttcattggtt ttattatttt caggtcatgg taatgaagtt attccatatt 82980 
cagaaaatga atatggaata gttacaatct tcttttgtgc tctgaatata accttgaaag 83040 
tttggtatgt ctgaaagagc cttttctatt atgtacggct agtattttct caggggaaat 83100 
ttttaaaaac atatttcaat ttctttctga ggatatttgc ttccttgtaa ataaatgctt 83160 
tcaattgctt attatattag caatcaattg ctgatacaca aattatgaca aaacatggtg 83220 
gcttaaaaca ataaacattt attatctcac agtttctatt agtcaagaat tcaggaacag 83280 
cttggctagg cagatctgga taaaggctta tcaaaaggtt cggtcagatg ctttgtggta 83340 
ctgtagtcat cagaacgctt ggctgggact ggacaagatg gatcactcac ataggtggta 83400 
aattagtact ggctttagtg gaatactcag ttctccacat gggcctctct ataggtacat 83460 
tttagtgtcc tcacaacata gtagctggct tttccaagaa taacctattc aagtgactga 83520 
ggtggaagtg gcaacatatt tatgaactag tctcaaaggt catacaccat cactttcata 83580 
tgattatttt ggttgtacaa atcaacttgg attcagtatg ggaaaagaag aaccacaaaa 83640 
ctaggaagta aggattattt tcgacatgtt ggaggttggc aaccacacta tcttcatatt 83700 
tttccaaagc atttgataac ctatggtttc tcccaattgt taactttaaa atgtcttctt 83760 
gatttagaaa tattcatgtc agtaaaattt tttctaactg tatctacaat aaatgttaag 83820 
actatatata aaatatatct tttgacatag ggaaaacgat agagaaagaa attctgataa 83880 
ttctctcttc tataaaggaa gcaagaaaat tggcaaaaaa tatcagaatc aaatttgtaa 83940 
aactctggaa attaaccaaa cacttgcaat tactcaggga gcatttattc aggaaaatgg 84000 
ctgaatctca gtgagaaaaa tgagctttgc ggtgctttgg cttgcattat tctcagcccc 84060 
cagtatccaa ctctgtaaca gccttaaaaa gtaatagctc tcattcatag tgaaaacaaa 84120 
ttgtctggca gccactgtag aaagcaaaac agggacagag ccctttcaaa ttcccaagaa 84180 
attgccatta tttgacatgt ctggcagttc tctggaagac tccacttgca aggctgtctt 84240 
tat tt tact t aaatcagaac tcacccactg tgaaaaacgt ttctcctctg ggcattcgtc 84300 
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aaaacagtta caggcaattg attaactttg cttatgacaa aggtattgga taatagtcag 84360 
ggaaaacaac agactaacca aagcttaaga acatacgctg agtaatgaga tatccatagg 84420 
ggatttgaaa atctctgaca tactcccaaa aatctagaag accaggtata tgcccaggac 84480 
tatgctcatc ctaaggaaaa aatagagaag gcactgagct gtaacctgtg aatgatcttg 84540 
aggctttgtg caagcaggaa gtgagggcaa agacagaatt gtaagttgcc tggctaagtg 84600 
tcaaaatcat gctccaacat tcacacagac ttcctctgca atgaatgggg gaattaatag 84660 
tttctagaat gtaagaaaat ctctgtttag gacctatcac caagatcatg gagtaggaga 84720 
tattcatccc tccaaagcat tcaagtatag acagctattc acaaactaaa atagcccaga 84780 
gaggactcaa gggcccatta aagaatttgc agcaacatcg tgaaggaaaa aatggagaat 84840 
aaccacatag aaagaattgc tggtgtgatt ggcacaccta agaggccatg atttggctag 84900 
gaacaaagaa gaaaggtgga agctatcagt attagacaca gaaaggacca acattgtctc 84960 
tagtggcctg ctccacagga gacactagca tcttttccca ctgagtaacc aacagtcatt 85020 
cctgcaagag aaccccagag ggggagatgt ggctatacct tcctcctatc ccccaagaag 85080 
cagttgctgt cgagctgctt tgggaatggg gtcaccacct ctcccaacca tgtgtaagct 85140 
ccagccctgg agccatggcc accctgggaa tgcccacact ccagactgat agtctgacta 85200 
cactgggctc acccatgttt ttaacatcac agccagcacc atagtaagct agttaacact 85260 
ccaagcccca gtgttaaact ctgcctgcat gtgtccacac tccagacact gacttaacca 85320 
ccatagagag ctagccccat ccaggcctca gagccattat aattctgtgc atacctgtgc 85380 
tctcattctt gtctccttga ctaatttata cgtatactat taccaacatt acagcagggg 85440 
cacctgtgcc ctgggcaaca gctcagccat agaaagctac gccttgcccc aaccctgaag 85500 
ccactgaaag actgtgttct cagctcccac actactttgc atgtgcactc atgcctcaca 85560 
tactataeca tgcagcagaa ggggcatctg caccttgagc accaatgtca ttaccattct 85620 
aatcccagag ccataatctc tccatgtgag cccatgcatc agacctcagt ttcatggcta 85680 
ctccatgagt gtcacccatc agacactggt gatactgcca ctaagagagg ccctacaagc 85740 
cagatctagt accaagagaa atcttctcag ctccagcttc cctggtgggg ggaaagatca 85800 
gaagaacctt agcagccatc accactgaag accctatcaa ttcttgctgc tactgccaat 85860 
atccactgct ttggccactg aggatcctta tgatctttat caactctgac ctgagctgaa 85920 
aaagctacac agagactaca aatctgcacc atcactgatg ctaccccacc caacaaaccc 85980 
tccctcatag ggcatggtct ttccatagtg aactagttca taaaatctgg aaggtgattc 86040 
tttttggttt tgttgatgtt tatctttccc ttatattgat gtgtgtgcat ttgactgctt 86100 
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taccaaacgc acacacatca acataaggca aaaataaaca taaaaaaaca gacataacat 86160 
catcaaaaga acacagtaat tttacagtaa ctggccccaa tgaaatagag atatatgcaa 86220 
tgcttgacca aaaaaatcaa aataattttt ttaggaaagc ttagtgaact taaagtacag 86280 
agaagcaatt caatgaaatc agtaaaacag tacaaccaaa acaaagaatt aaagagagag 86340 
actgaaatta ttttttaatt aaatagaaat tctggagcta aaaaacatta taaatgaaat 86400 
aataaatgca ctagacagca tccagcagaa attatcaagt agacaaagaa tttgtgaagt 86460 
gaaaaacagg ttatttgaaa atatacaact aaagaaaaag aaagaaaaaa gaatgagaaa 86520 
agatgaagaa agcttaagag atttatgagg cagcatcaaa agagaaaatg tgcaatgcat 86580 
tagtccattc tcatactgct ataagaatgt attcaagact gggtaattta taagggaaag 86640 
agctttagtt gactcacagt tctacagggt tggggaggcc tcaggaaact tataatcatg 86700 
gtggaaggga aagcaaacat gtccttcaca aggtggcagg agagagaaga atgagaactg 86760 
agcaaaaggg gaagcccctt ataaaaccat cagatctcat gagaacttac tatcatgaga 86820 
ataccatggg ggaaaccacc tccatgattt aattacctcc cactgaattc ctcccatgac 86880 
atgtagggat tatggcaact aaaattcaag atgagatttg ggtaggaaca tagccaaact 86940 
atattataca agttatagga gttatagaag gagatgagag agacaaaaaa aatatagatt 87000 
atttaaagaa ataatagcag aatagtttgc aaatctgggg aaagatacaa atatccaggt 87060 
gtacaagaag gtcagaagtt tctaatcaga tttaatgaaa ataagactac accaagacat 87120 
attataatca aactgtcaaa aatcaaaaac agagaaaatc ctgataacag caagagagaa 87180 
gaggcaaatc acatataaga gagttataac aaggctagca gaaaatttct cagcaaaaac 87240 
cttgcagaac aggagagact agaagaacat atgtaaggaa ggaaaaacaa actgccaacc 87300 
aaaaatattt acccaacaaa gctgtccttt aggaatcaag tagagagaaa gatttcccca 87360 
gacaaacaaa agctgaaaga gttcatcacc accaaaccta ttttccaaaa attctacaca 87420 
gaatcttcaa gctaaaggaa aaaaaaaaga gcgagctaat tagtgacatg aaaatatata 87480 
aaagtgattt tgggctgaga caatggggtt ttctagatat acaatcatgt catctgcaaa 87540 
cagggacaat ttgacttcct cttttcctaa ttgaataccc ttcatttcct tctcctgcct 87600 
aatggccctg gccagaactt ccaacactct gttgaatagg agtggtgaga gagggcatcc 87660 
ctgtcttgtg ccagttttca aagggaatgc ttccagtttt tgcccattca gtatgatatt 87720 
ggctgtgggt ttgtcataga tagctcttat tattttgaga tacatcccat caataccgaa 87780 
tttattgaga gtttttagca tgaagggttg ttgaattttg tcaaaggcct tttctgcatc 87840 
tattgagata atcatgtggt ttttgtcttt ggttctgttt atatgctgga ttacatttat 87900 
tgatttgcat atattgaacc agccttgcat gccagggatg aagcccactt gatcatggtg 87960 
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gataagcttt ttgatgtgct gctggattcg gtttgccagt attttattga ggatttttcc 88020 

atcaatgttc atcaaggata ttggtctaaa attctctttt ttttgttgtg tctctgccag 88080 

gctttggtat caggatgatg ctggcctcat aaaatgagtt agggaggatt ccctcttttt 88140 

ctattgattg gaatagtttc agaaggaatg gtaccagttc ctcagaaaaa atgctcacca 88200 

tcactggcca tcagagaaat gcaaatcaaa accacaatga gataccatct cacaccagtt 88260 

agaatggcaa tcattaaaaa gtcaggaaac aacaggtgct ggagaggatg tggagaaata 88320 

ggaacacttt tacactgttg gcagaactgt aaactagatc aaccattgtg gaagtcagtg 88380 

tggcgattcc tcagggatct agaactagaa ataccatttg acccagccat cccattactg 88440 

ggtatttacc caaaggacta taaatcatgc tgctataaag acacatgcac acgtatgttt 88500 

attgtggcac tattcacaat agcaaagact tggaaccaac ccaaatgtcc aacaatgata 88560 

gacgggatta agaaaatgtg gcacatatac accacggaat actatgcagc cataaaaaag 88620 

gatgagttca tgtcctttgt agggacacgg atgaaattgg aaatcatcat tctcagtaaa 88680 

ctattgcaag gacaaaaagc caaacatcgc atgttatcac acttagatgg gaactgaaca 88740 

atgagaacac atggacacag gaaggggaac atcacactct gggcactgtt gtggggtggg 88800 

gggagggggg agggatagca ttaggagata tacctaatgc taattgatga gttaatgggt 88860 

gcagcacacc agcatggcac atgtatacat atgtaactaa cctgcacatt gtgcacacgt 88920 

accctaaaac ttaaagtata ataaaaaaag aaagaaaata tataaaagta tatatcactc 88980 

actgttaaaa gtaagtacat ttacaatact ctaataatgt aatggtggtg tgtaaatcac 89040 

tgatatcatt aggataaagg ctaaaagaca aaactattaa aaataataat agctacaata 89100 

gaggtatacc tcagagatat tgtgggttca attccagaat gcctaaataa agcaaatttc 89160 

acaataaagc taatcacaat atttttgttt cccagtacat ataaaagtta tgtttacatt 89220 

ttactgtagc cttttaaatg tgtgataata tgtctaaaaa gtatacatgt taatttaaaa 89280 

atacttaatg gctaaaataa aaagattaac agtcatctga gccttcatca aatattggcc 89340 

tttttactgt tggaaggtct tgctttgatg ttgatgactg ctgacttatc agggtggtgg 89400 

ttgccaaagg ttgtggtggc tgtggcaatt tcttaaaatc agacaacaat aaagtttgtt 89460 

gcatcaattg acttttcctt acatataaaa aattatctgt agtatgcaat actgtttgat 89520 

agcattttac ccagagtaga acttctttca aaattggagt caattctctc aaatcttgcc 89580 

actgttttat caaatatgtt tatgtactag tttaaattat ttgttatcat ttcagcaatg 89640 

ttcacagcat tatcaccaag agtagattcc atttcaagga atcactttct ttgctcatcc 89700 

ataataagca attcctcatc cattctagtt ttattaagaa agtgaagcaa ttcagtcaca 89760 
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tgtacacact ccacttctaa ttctatttct cttgctattt ccaccacttt cgcagtacct 89820 
ttctccagtg aaatcttgaa cccatcaaag tcattcataa gagttggaat caaccttcta 89880 
cactcctgta aatgttgata cattcacctt ctcccataaa tcatgaatgt tcttaatgac 89940 
atctaagtag tgagtccttc tggaaggttt tcaatttccc ttgtcctgat ccagcagagg 90000 
aattactatc tatggcagcc atggccttac aaaatgtatt tcttaaataa taacactgga 90060 
aagtcaaaat tactccttga tctatgggat gcaaaatgga tgttgtgtta gcagaaatga 90X20 
agacaagttt aatctttttg ttcatcgtca tcaaagctct tgggtaacaa tgtgcactgt 90180 
caatgagcag tgatatttgg aaaggaatct ttttttctga gcagtagata tcaacaggga 90240 
gcttaaaata ttcagtaaac catgctgtga acagatgtgc tgccctttag gctttgttgt 90300 
tttatttata gagcacaggc acagtagatt agtctaattg ttaagggccc taggatcttt 90360 
agaatgatca atgagcattg gcttcaactc aaagtcacca cctgcattag ctgttaaaaa 90420 
gaatatcatg atgtcctttg aagctttgaa gccaggcatt gacttctcct ctctagctat 90480 
agaagcgcta gatggcatct tcttctagta ttttactgtt tgtttccact gaaaaactat 90540 
cgtttactgt agccaccttc atcagttatc taacctcaat attctggata acttaatgca 90600 
acttctgtat tatcacttga tgctttacct tgcatgttta tattacagag acaacttcca 90660 
tccttaaacc tcatgaaaca- aactatgcta gcttcaaact tttctgctgc agctttcaca 90720 
tctatctcag ccttcataga ggtgaagaga gtttaagaac ttgttctgga ttaggctttg 90780 
gcttaatgtt gcaactagtt tgatcatata tccagaccac taaaactttc tccatatcag 90840 
cagtaagctt gtttcatttt cttgtaattt gtgtgttcac tgaagtagtc attttacttt 90900 
tcttcaagaa agtttccctt acattcacaa cttggctaac tggcccaaaa ctaagctttt 90960 
ggtctatctt ggtttctgac atgctttttc tccctaagct aagtcatttc tagcttttga 91020 
tttaaagtga gacatgtgtg actcttccta tcacttgaac acctaaagac aatttcaggg 91080 
ctatttattg gcttaatttc aatattactg tgtctcagag aataggaaca gctgaggaga 91140 
gggagacaga tggaggaaca accagttggt ggagctgtca gtatgcacac aacatttatc 91200 
agttaagttt gtcaccttat atggatgtga tttacagcac cacaaaacaa ttacaatagt 91260 
atcatcaaac atcacttgat cacagatagc cttaatggac gtaataataa caataatgaa 91320 
aaagctcaaa ataaccaaaa tatgacacag acatgaaata aacacatagt gctggaaaaa 91380 
tgacagactt gctcaacaca gggttgccac aaaccttcaa tttgtaaaaa aaatgcacta 91440 
tctgtgaaga acaataaagt aaatcatgac aaaatgaggt atatttgtag tgtgttaagg 91500 
gatatactat ataaaaaatt taaattgtga catcaaaaac taaaatgggg gtggtagagt 91560 
aaaagtgtag agtttttgtg tgtaatcaga gttaagttat cattttaaaa tcgctcgtta 91620 
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taagacattc ctttgtaagt ctcaaggtaa ccacaaagca aaacctgtag tagatacatg 91680 
aaagataaaa agtaaggaat caaagcatac cattagagaa aagcatcaac cacagaagaa 91740 
gacaggaaaa atagaagaaa ggaaaaaaaa cacacaaagc agaaagagtg gaagacagaa 91800 
aaaaaatata tgatagccag aatataatta ttaaaatggc aatttagggc acatataact 91860 
gaaagtaaaa atggaaaaaa gatattccat gcaagtggaa accaaaatag atgaggggtt 91920 
gctacattta catcacatag aatagacttt aagtcaataa cagtaaaaca gggcaaataa 91980 
ggtcattgca taataataaa cggattattt catcaggaag aaataacagt tataaatata 92040 
tatggaccct acactggatc acctaaatat ataaagcaaa tattaataga cctgaagaga 92100 
gaaatagact gcaataaaat cataggtctt caaaatccca cttttaataa acagattatc 92160 
tagacagata atcagtaaag aacgattgga tttaaactac actttagacc aaatggacct 92220 
aatagacaca tatagaacat ttcacctagt ggcagcagaa tatataatct tttgaagcac 92280 
Gcaaggaaca .ttctccagga taatcataga gtaagccaca aaacaagtca caacaaaatt 92340 
aagattgaaa tatataaatt attttttcaa aaacaatgac atgaaactag taatcaataa 92400 
tataaggaat ttcaaaaatg ttacaaacgt gaaaattaaa caagagactc ctcaacaatc 92460 
aataggtcaa agaagaaatt aaaagcaaaa atttaaaata tcttaagata aatgaaaatg 92520 
aaaacacatc ataacaaagc ttatgggatg caacaaaagc aggtctaaga ggggagttta 92580 
taccaataaa tacctacatc aaacaaaaga tttaaaataa ataacctaat attacaactt 92640 
aagtaactag aagaagaaga agagtcaggg tttccctttc ataaaaaggg agaaataata 92700 
aaaattagag aagaaagaaa acaaatggag aatagaaaaa caatagaaaa gatcaatgaa 92760 
gagttagttc tttgaaaaga taaaattgac aaacctttag ctagaataac taggaaaaag 92820 
aagaaaactc aaataaaatt agaaatgaaa gctgagacat tacaactgat accatagaaa 92880 
caaaaagcat cataacaagc gactatgaaa aactaaatgt cagccaggcg tagtggctca 92940 
tgcctgtaat cccagcactt cgggaggctg aggcaggcag atcacctgag gtcaggagtt 93000 
tgacaccagc ctgaccaaca tgcagaaacc ccgtctctac taaagataca aaaattagcc 93060 
aggtgtggtg gcgggcccct gtaatcccaa ctactcagga ggctgaggca agagaattgc 93120 
ttgaacccag gaggcagaag ttgcagtgcg caaaaaaaaa aaaaaaaaaa aaaaatagat 93180 
aacctagagg aaatggataa gttcctagat acacaaaaac taccaagact ttgagtattg 93240 
aaggaatgga aaatctgaac acaccaatac taaggagatt taatcagtaa taaaatgttt 93300 
ccattccaag aaaagaccag gatcagatgg cttcactgca aaattctacc aaacacttag 93360 
agaagaacta atagcaatct ttctcaaagt cttcccccaa aaaagtgaaa aaagtgttta 93420 
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cttccaaact catgttacaa agtcagcatt actctgataa caaagtcaga caagatacta 93480 
tgagaaaatt acagggcaat aaccctgatg aacataggtt taaaaatcct caacaaaata 93540 
ctagcaaaca tatttcaata gcacacttga aatattattc accatgatta agtgtaattt 93600 

attcctggaa tgcaacaata gtttaacata cacaaattaa taaatgtgac atgccacatt 93660 

aacaaaatga aagataaaag gcatatgatc atcttaatag atgcagaaaa acatcgtgac 93720 

aaaattcaat atctttttgt agtaaaaatg ctcaacaaat taggtacaga agagaagtgc 93780 

ttcaacaaat aaagaccaca tatgacaagt ccactgctaa catcacagtc aacagtggta 93840 

aactaaaagc ttttcttcta agattaggaa caagataaga atgctcacta ttgcaacttc 93900 

tattcaacat agtactggaa gttctagcca aagaagtaag gcaagaaaaa aagaaaagaa 93960 

aaaagaaagg cattcacatc agaaaggaca aacttaaatt ttctgtttga agatgacctg 94020 

atcctacata tatataaaat atataattta tatataatat gtattatgtt ataaatatat 94080 

gttatatgtt ctatgttata tataatatat gttctatgtt atatataata tatgttatat 94140 

gttatatgtt atatataata tatgttatat gttatatata atatatgtta tatgttatat 94200 

atatgttata ggttatatgt tatatatgtt ataggttata tgttatgtta tatatatgtt 94260 

atatgttatg ttatatatgt tatatgttat attatatatc atatatgtta tatgttatat 94320 

tatatatcat atatgttata tgttatatta tatatcatat atgttatatg ttatattata 94380 

tatgttatat gttatattat atatgttata tgttatatgt tatattatat atgttatatg 94440 

ttatatatta tattatatat gttatatgtt atatattata ttatatatgt tatatgttat 94500 

atatattata tataatatgt tatataatat aatatatatg ttatatatta tatataatat 94560 

gttatatatt atatatgtta tatattatat tatatatgtt atatgttata tattatatta 94620 

tatatgttat atgttatata ttatattata tatgttatat gttatatatt atatgttata 94680 

agttatatat aatatagata atatataaaa tatataatat atatacacaa aacacaaaac 94740 

accaccaaaa actgttaaaa ctaacaaatg aatttactat agttgcagga tacaaaatca 94800 

acatgcaaag tcagtaatgt ttctatacac taacaatgaa ctatccaaaa aaaaatagaa 94860 

tctatttaca ataactccaa aaagaataaa atacgtagaa atgaagttaa tcaagtaggt 94920 

gatagactta tacaaaaaaa ctataatata ttgatgaaag aaattaaact aggcaaaaat 94980 

tggaaaggca tcctgttttc ataaatatac agacttaata ctttaaagtg tctataaatc 95040 

aaagctatct acagattcaa tgcattccct atcaaaatcc caacagcatg ttttacagaa 95100 

attaaaaaaa atcaaaaatt caaattaaat cacaaaagtc ccagaattgc cagtgttaac 95160 

ttgagaaaga acaaagctca aggcatcaca cttcctgctt acaaaaatat attatgaagc 95220 

tacagtaatc gaaacagtgt tgtactggca tgaaggcaga catatagaca gatgaaacag 95280 
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aataaagaag ccagaaataa atccatgtat gtatggtcca ctgatcttca acagtgatgc 95340 

caaaaataaa cagtggtgcc aaaaataaac aatggggaaa ggacagtctc ctcaacaagg 95400 

tgttggaaaa ctggacatct acatgagaag agtgcaattt aacccttaac ttatacaata 95460 

cagaaaaaaa tcaactcaaa acagattaaa gacttaaatg taagacctta aactataaaa 95520 

ttcctagata aaaacagggg aaatgttcct tgacatttct cttggcagtg atttttttgg 95580 

ataagacacc aaaaactcaa gaaacaaaac cacaaataga caagtgggac aacattaaac 95640 

taaataactc tggtgcaata aagaaaacaa tgaatcaaat gaaaaggtga cctatggaaa 95700 

gggagtaaat atttacaagc catacatatg ataaagggtt aatatccaaa atatattaga 95760 

aacatctaca actcaattta aaaaatagta ccaggcatgg tggctcacgc atatagtcct 95820 

agtactttgg gaggttgagg ctggtgaatg gattgagcgc aggtattcaa gacccgccag 95880 

ggaaacatgg caaaccccta tctctacaaa aacacacaca cacacacata tatatatatg 95940 

tacatatata cacacacaca tatatatgta catatatata tatgtgtgtg tgtatatata 96000 

tacgtatata tgtgtgtgta tatatgtgta tatgtgtgtg tgtgtgtatg tgtatatata 96060 

tatgtatata tatatttata tatatgtata tatatatatg tatatatata cgtatatata 96120 

tatatttata tatatatata tatattagcc aggcatggtg ggtgtgtgtc tgtagtccca 96180 

gctactcagg aggctgaggt agtaggattg cttgaacccg gggtgtcaag gctgcagtga 96240 

gccatgacag caccattgca cttcagcctg ggcaacagag agagaccctg tctcaaaaca 96300 

aacaaacaaa aacataggct gtgtgcagtg gctcatgctt ataataccag caccttggga 96360 

agccaaagca ggaggatttg agaccaggag tttgagacca gcttgggaaa catagcaaga 96420 

ccctgtctct acaaaaagta aaaataaaaa taaataacca ggcatattgg catgagttag 96480 

tagtcctagc tacatgggag gctgaagtga gaggattgct tgagcccagg agttcaagac 96540 

cagcctgggc aacatagtga gacccccatc taggaaaaca aaaagcacag gcaacaaaag 96600 

caaaagtata caaatgggat tgcaggaaac taaaaagctt ctgcacagca aaagaactat 96660 

caacagagta aagatacagt ttacagaatg ggagactatt ggcaaactat gcatctgatg 96720 

aggagttaat atccagaata tatgagaaac tcaaacaact caatagcaaa aaacaaaaaa 96780 

taaacccaat ccaaaaaatg ggcaaatgac ctgaatagac atttctcaaa aaatatatat 96840 

acaaaagaag acatacaaat gggcaacatg tatacgaaaa tatgcttaac atccctagtc 96900 

atcagggaaa tacaaatcca aaccacagta agataccacc tcagtctagt tagaatggct 96960 

actacagaaa gacaaatgat aacaagtgtt ggcaaggatg tggagaaagg aaactcttac 97020 

atattgttgg tgggaatgta aattagaaca gtcactataa aaacagtatg gaagttcctt 97080 
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gaaaaattaa aaattgaact gccatatgat ocagcaatco cactaatggg tataaaocca 97140 

aaagtaatga aatcagtatg tcaaagagao atctgctgtc ctgtgtttat tacagcacta 97200 

ttcataatag ccaagatatg gcaacaatct atgtccatca atggatgaat gaatttaaaa 97260 

atcttggtat atacataata tgcaatacta ttcattcata aaaaagaaag aaattgtttc 97320 

atctgtgaca acatagatga aoctggagaa cattgtgtca agtgaaataa gctacacaca 97380 

gaaagaaaao tactgcatga tttcactgat atgtggaatc ttaaaaagtt gatctcatag 97440 

aagctgagag tagaatggtg gttagcagaa gctggtgagg gtaagtgaaa gagggtatgg 97500 

gagaacactg gtcaatgagt acaatgttaa agttaaatag aggaacgggt tctagtgttc 97560 

tattgcacat cagggtgaat ataaaaaaca atagtgtatt atatatttca aaataacttg 97620 

aagaaaggat tttaaatgtt otcaccataa agaaatgata aatgtttaag atgatggata 97680 

ttctaattac cctgatttga gcattacaca atatatacac atatcaaaac atcatatcgt 97740 

aooacataaa tatgtacgat tattctggct caattgaaaa taaaataaaa ttttagaaac 97800 

gttaagaaag gaatattatg aacaatctta tctcacagat tttataacct agttaagatg . 97860 

gaccaattcc ttgaaagata cagtctatta taactcatac agggagaaat aggcaatcta 97920 

« 

aatactgtta attgtattaa aggaattaat tattgaataa acttccaaaa cagaaagcat 97980 

caagcccaga tgggttcact ggagtattcc accaaacatt taggaaagaa aactacacca 98040 

agtctctaca gttgcattga aaatatagaa gcagaggcaa tattttctaa ctcattgtat 98100 

aaggtcagca ttacactaat accaaaagca gacaaaaata ttgcaaggaa aaaaacacat 98160 

caatatatct gatgaatata gatgcaatat ttctcaagaa atattagcaa gtcaatccaa 98220 

caaatttata aaaataattc tacacttgag ataagcctgg gcaacatagc aagacactgt 98280 

acctacaata aatacaaaaa ttagccaagc atggtggtgc atacctgtac tcctagctac 98340 

ttgggaggtt gaggcaggag gattatttga gcccaggggt tagaggctac agtgagctat 98400 

gattgtgcca ctgcactcca gcctgggtga cagagtgaga ccctatctct taaaa'aaaga 98460 

aaaaagaatt atacaccaca aacaagtaag gcttatccca aaaaagtaag actggttcaa 98520 

cactggatat cagttaatgt aacccattat atcaataggc taaagaagaa aagtcacaag 98580 

attataacaa cagattttta aaaagtgttt aacaaattac aacacctatt atgatttcaa 98640 

aaaaaattca gtaaactagg aatagaaagt aatcttcctc aacttgaaaa ataacatcta 98700 

caaaacagcc ttcatctgac atcttattta atagtgagaa actatgagct ttcttgctaa 98760 

gattaggaag aaaacaagga tttccctctc acattccttt tcatcattgt acagtaagtc 98820 

ctagctagtg caatgagaca aaaaaggaaa taaaagacac actaattgga ggagagaaat 98880 

aaaatgttgc aggtgacatg attgtctatg cagacaatct aaaataatta accaaaaaaa 98940 
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ctctcccaga actaataggc aattaaagca agattacaga atatgttaat atacaaaagt 99000 
tcattgcttt cctatataac cgcaataaac aagtataatt tgatattaaa aaacaatatt 99060 
atctgcatta gcacacctga aaatgaaata attaggtata aatttaataa aatatgtgca 99120 
agate tatac aaggaaaact acgaaacttg aatgaaagta atcaaagaaa aactaaataa 99180 
agagttattc catgtataca taggaagatt caatattgtc aaggtgtcag atcttctcaa 99240 
cttactctat gtagattcaa cacagtccca gtcagtaacc cagcaagtta ttttgtggat 99300 
atcaacaaac tgattctaaa ctttatgtga aagacaaaac atgcagaatg gtcagcataa 99360 
tattgaaggt gaagaacaaa tttgaaggac tgatacttcc caaaatcaat atttactata 99420 
aagctatagt aattaagaca gtgtgctatt agtgaaagaa caaacaggtc aatggaacag 99480 
aatagcaagc ccagaaatag tctggcaaaa atatagtcaa ccaatattta caaaaaagca 99540 
aaatacaatg gagaaaaggt cataatttta acaaatggtg ctgaaacaag aagacatcca 99600 
cacgcctaat aataaatcta aacacagatc ttatatccat gacaaaaacg aacataaaag 99660 
aaaacctaga tgaccttgag tatggtgatg actttttaga tacaacacca aaggcatgat 99720 
ccatgaaaaa aagaactgat aagctggcat ttattaaaat taaaattaaa atttctgctc 99780 
tgcaaaaggc aaatgagaag caaaccacag actaaagaga atatatttgc aaaaggcata 99840 
cctgataaaa gattgttatt caaaatatgc agaagagctc ttaaagcaca ataggtaaat 99900 
gaacaatctg attttaaaat ggtcaaaaaa acagacacct caccaaaaaa ttacacagaa 99960 
ggcaaataag taaatgaaaa gatgctcata tacttatgaa tatctttgtt attagggaat 100020 
tgcaaattaa aacaatgata tagcactgca tatttattag aataatcaaa atctaaaaca 100080 
ttaacaacac aaaatgctat tgagaatgtg aagcaacagg aacgttcact cattgctagt 100140 
gggaatacaa aatagcacag ccacttcagc gacagtttag tggtttattc caaaattaca 100200 
ttcttaccat gcattccagc aattgtgctc ctgggtattt acccaaataa gttgaaaaca 100260 
tatattccca caaacacctg catacaaacg tttatatcat gtttattcct attttccaaa 100320 
cttggaagca accaagatgt cctttggtag gtaaatggag aaataagctg tggtacatcc 100380 
atacaatgaa atattattca gtgctaaaaa gaaatgagct gtcaagctta tttaaaggtt 100440 
acgtgtctca tgcaaagaca tggaagaaat ttatgcatat tagtaagcaa aagtgacagt 100500 
ctgagaagct acatactgta gattccaact ttatgacatt ctggaaaagg caaaactctg 100560 
gagaaagtga aaacattagt ggttgccagg gcttggggag agggaggcag agcacagagg 100620 
gggaacaaag ggcagtagaa ctactctgta ctatactata acagtggaca catgtctata 100680 
cttttgtcaa aatacaatgt ataacaccaa gagtgaatcc taatgtaaac tatgaacttt 100740 
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gggtgataat gatgtgtcaa tgtaggttca tccactgtaa caaatatacc actcttgcat 100800 
ggaagcgtga tggtgagtga ggctctgtgt atgggggaca gggggtttat gggaaatctg 100860 
taacttctgc tcaattatgt tattaatcta aaactgctct aataataaaa tctactttta 100920 
aacatgtaat agaattccca agaactgtgg gacaattaca caaagtgtag cacacacata 100980 
taacagaaat accaaaaaaa gaaaagagca gaagaagtat ttgaagaaat aaaagatgaa 101040 
aaattttcta aattaatggc aaacaccaat ccagagatcc aggaagctca gagaacacca 101100 
aagcaaaata aatgtcagaa aaaatgtacc tctaggcata tcatattcaa actgcagaaa 101160 
accaaagaca aagataaaat cctgatagaa gttgaggggg aaacttacag ataaataaac 101220 
aataataaga atttcctctg acttatcaga aatcagccaa gtaagaagat agtataataa 101280 
aatatttaaa gtgggaaaag aatacaaatc cccctgacct agaattctct atcaagtgaa 101340 
attattcttc aaaagtgaag aagaaataaa gactttctca gacatacaaa aacaggaaat 101400 
ttctcaaaag taaacttttc ttacaaaaaa tgctaaaaga agttcttcag ataaagggga 101460 
aaatgataaa ataataagca caaatcaata taaaggaaga gcaattgaga aggaataaat 101520 
gaaggcaaat taaagctact tttcttactc ttaattgacc tatagatagt tgctcgacac 101580 
aaaaatagca ataatgcatt ggtaatcatt attaccactt attaggataa gtgaactgaa 101640 
tgacaatatt aaaagagatg cagcccaggc acgggtggcc catgcctgta atcccagaac 101700 
tttgggaggc cgaggcaggt ggatcacttg aggtcaggag cttgtggcca gcctgaccaa 101760 
catggtgaaa ccccgtgtct actaaaaata caaaaatagg ccagatgtgg tggcacgcac 101820 
ctgtaatccc agctactcgg gaggctgaag caggagaatc tcttgaaccc ggaaggcgaa 101880 
ggttgcagtg atccaagatt gtaccactac actccagcct gggtgacaga gtgagactct 101940 
gtctaagaaa aaggggcggg gggttgaatg caaagagaaa tacaatttaa ctgaacagct 102000 
atcagttaaa gtgaactgaa ctattaatta aatggatcta attggcattt gtagaatact 102060 
tcatccagta atagcagaat atatcttctt atcaaactta tatgcgattt ccatcaagag 102120 
aaacttcata tggaccataa aacacatttt aacaatgtaa aaaaatagaa attacacaaa 102180 
gtatactctc agaacacaca gaatagaact aaaaatcagt aacagaagga tagctgaaaa 102240 
ttttcaaact gtttggagat tgaacaacac acttctaagt aataagtgaa ttgaagaaga 102300 
tatctcaaga aatattttga aataaatgaa aatgtaacat catagtttgt aggatgcagt 102360 
aaaagcagtt ctttgagtga aatttataac actgaatgga tatattagaa aataaaaaaa 102420 
tgtgatatta ataaactaat cttaggcctt aggaaactag agaaagaaaa gaaatgtagg 102480 
cctaaagaaa gcagaataaa agaaatgata aaaattggag cataaattat taaattgaag 102540 
tagaatttaa attttctctg cagagtaagt atgtgaggta atgaatatgt taattagctt 102600 
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tacttagcca ttccacatgt atacataaac acgttataca gcataaacgt atattatttt 102660 
gtttttcaat taaaaattta attaaaatca gaaaaaatta ataattaaat ttaaaacagg 102720 
aattcaatac tgaaaaatca atgaaaccaa aagctgtttc tttaaaaaga taataaaatt 102780 
gataaatttc tagccaggct aatcaagaaa agaagaggat tcaacattct taaagaaaac 102840 
aattttcaac ccagaatttc atatccagcc aaactaagca tcataagtga aggagaaata 102900 
aaatatttta cagacaagca aatgctgaga gacttttgtc accaccaggc ctgccctaaa 102960 
agagctcctg aaggaagcac taaacatgca aaggaacaac cggtaccagc cactgcaaaa 103020 
ccatgacaaa atgtaaagac catcgagagt aggaagaaac tgcatcaact aacgagcaaa 103080 
ataaccagct aacatcataa tgacaggatc aaatttacac ataacaatat taaccttaaa 103140 
tgtaaatggg ctaaatgctc caattaaaag acacacactg gcaaattgga taaagagtca 103200 
ggacccatca gtgtgctgta ttcaggagac tcatctcatg tgcagagata cacataggct 103260 
caaaataaag ggatggagga agatctacca agaaaatgga aagcaaaaaa aagcaggggt 103320 
tgcaatccta gtgtctgata aaatagactt taaaccaaca aagatcaaaa gagtcaaaga 103380 
aggccactgc ataatggtaa agggatcaat tcaacaagaa gagccaacta tcctaaatat 103440 
atatgcaccc aacacaggag cacccagatt cataaagcaa gtccttagag acctacagag 103500 
agacttagac tccaacacaa taataatggg agactttaac accccactgt caatattaga 103560 
cagatcaacg agacagaagg ttaacaagga tattcaggac ttgaactcag ctcagcacca 103620 
agcagactta atagacatct acagaactct tcaccccaaa acaacagaat atacattctt 103680 
ttcagcaccg cattgcactt attccaaaat tgactacata attggaagta aagcactcct 103740 
cagcaagtgt aaaagaacag caatcacaac aaactgtctc tcagaccaca gtgctatcaa 103800 
attagacctc aggattaaga aactcactca aaaccgcaca actacatgga aactgaataa 103860 
cctgctcctg aatgactact gggtacatag tgaaatgaag gcagaaataa agatgttctt 103920 
tgaaaccaat aagaacaaag acactatgta ccagattctc tgggacacat ttaaagtagt 103980 
gtgtagaggg aaatttatag cactaaatgc ccacaagaga aagcaggaga gatctaaaat 104040 
tgacacccta acatcacaat taaaagaact agagaagcaa gagcaaacaa attcaaaagc 104100 
tagcagaagg caagaaataa ctaagatcag agcagaactg aaggagatag agacacaaaa 104160 
acccttcgaa aagtcaatga atccaggagc tggttttctg aaaagatcaa caaaattgat 104220 
agactgctag caagactaat aaagaagaaa agagggaaga atcaaataga tgcaataaaa 104280 
aataataatg tggatatcac taccaatccc gcagaaatac aaactaccat cagagaatac 104340 
tataaacacc tccatgcaaa taaactagaa aatctagaag aaatggataa attcctggac 104400 
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acatacaccc tcccaagact aaaccaggaa gaagttgaat ctctgaatag accattaaca 104460 
ggctccaaaa ttgaggcaat aattaatagg ctaccaacca aaaaaagtcc aggaccagac 104520 
caattcacaa ctgaattcta ccaaaggtac aaagaggagt tggtaccatt ccttctgaaa 104580 
ctattccaat caatagaaaa agagggaatc ctccctaact cattttataa ggccagcatc 104640 
atcctgatac caaagcctgg cagagacaaa acaaaaaaaa gagaatttta gaccaatatc 104700 
cttgatgaac atcgatgtga aaatcctcaa taaagtactg gcaaactgaa tcctgcagca 104760 
catcaaaaag cttatccacc atgatcaagt cggcttcatc cccgggatgg aaggctggtt 104820 
caacatatgc aaatcaataa atgtaatcca gtatataaac agaaccaaag acaaaaacca 104880 
catgattatc tcaatagatg cagaaaaggc ctttgacaaa attcaacaac ccttcatgct 104940 
aaaaacactc aataaattag gtattgatgg gacatatctc aaaataataa gagctatcta 105000 
tgacaaaccc acagccaata tcatattgaa tgggcaaaaa ctggaagcat tccctttgaa 105060 
aactggcaca agacagggat gccctctctc accactccta ttcaacatag tgttggaagt 105120 
tctggccagg gcaattaggc aggagaagga aataaagagt attcaattag gaaaagagga 105180 
agtcaaattg tccctgtttg cagatgacat gattgtatat ctagccccat catctcagcc 105240 
ccaaatctcc ttaagctgat aggcaacttc agcaaagtct caggatacaa aatcaatgta 105300 
caaaaatcac aagcattctt atacaccaat aacagacaat cagagagcca aatcatgagg 105360 
gaacacccat tcacaattgc ttcaaagaga ataaaatacc taggaatcca acttacaagt 105420 
gacgtgaagg acctcttcaa ggagaactac aaaccactgc tcaatgaaat aaaagaggat 105480 
acaaagaaat ggaagaacat tccatgctca tgggtaggaa gaatcaatat cgtgaaaatg 105540 
cccatactgc ccaaggtatt ttatagattc aatgccatcc ccatcaagct acaaatgact 105600 
ttcttcacag aattggaaaa aactacttta aacttcatat ggaaccaaaa aagagctcac 105660 
atcgcgaagt caatcctaag ccaaaagaac aaagctggag gcatcacact acctgacttc 105720 
aaactatact acaaggctac aataaccaaa acagcatggt actggtacca aaacagagat 105780 
atagatcaat ggaacagaac agagccctca gaaataacgc cgcatatcta caactatctg 105840 
atctttgaca aacctgagaa aaacaagcaa tggggaaagg attccctatt taataaacgg 105900 
tgctgggaaa actggatagc catatgtaga aagctgaaac tggatccctt ccttacacct 105960 
catacaaaaa tcaattcaag atggattaaa gactaaattg ttagacctaa aaccataaaa 106020 
accctagaag aaaacctaag cattaccatt caggacatag gcatgggcaa ggacttcatg 106080 
tctaaaacac caaaagcaat ggcaacaaaa gccataattg acaaatggga tctaattaaa 106140 
ctaaagagct tctgcacagc aaaggaaact accatcagag tgaacaggca acctacaaaa 106200 
tgggagacaa tttttgcaac ctactcatct gacaaagggc taatatccag aatctacaat 106260 
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gaactcaaac aaatttacaa gaaaaaaaca aacaacccca tcaaaaagtg ggcgaaggac 106320 
aagaacagac acttctcaaa agaagtcatt tatgcaggca aaaaacacat gaaaaaatgc 106380 
tcaccaccac tggccatcag agaaatgcaa atcaaaacca caatgagcta ccatctcaca 106440 
ccagttagaa tggcaatcat taaaaagtca ggaaacaaca ggtgctggag aggatgtgga 106500 
gaaataggaa cacttttaca ctgttggtgg gactgtaaac tagttcaacc cttgtggaag 106560 
tcagtgtggc gattcctcag ggatctagaa ctagaaatac catttgaccc agccatccca 106620 
ttactgggta tttacccaaa ggactataaa tcatgctgct ataaagacac atgcacacgt 106680 
atgtttattg cggcactatt cacaatagca aagacttgga accaacccaa atgtccaaca 106740 
gtgatagact ggattaagaa aatgtggcac atatacacca tggaatacta tgcaaccata 106800 
aaaaaggatg agttcatgtc ctttgtaggg acatggatga aattggaaat catcattctc 106860 
agtaaactat cgcaaggaca aaaaatcaaa caccgcaggt tcgcactcat aggtgggaat 106920 
tggacaatga gaacacatgg acacaggaag gggaacatca cactctgggg actgttgtgg 106980 
ggtgggagga ggggggaggg atagcattag gagatatacc taatgctaaa tgacaagtta 107040 
gtgggtgcac cacaccagca tggcacacgt atacatatgt aactaacctg cacattgtgt 107100 
acatgtaccc taaaacttaa agtataatag taataaaata aaatttaaaa aaagatagca 107160 
tgagtagaaa aaaaaaaaag aagagggaag ccacaagtta ctaatatcag aaatgaaaga 107220 
ggggtcatca ctactgattc tgtggacatt aaaagaatga taaataaata ccataaacaa 107280 
ctctatgccc acaagtttga tagcttagat gaaatgaacc tattccttga aaagctcaat 107340 
ctgccaaaac tcacataaag tgaaatagat aatttgaata ggcctatacc tactaaagaa 107400 
actgaattaa taattaataa tcttccacaa aagaaaacac caggcccaga tgggttcatt 107460 
agtgaattct gtcaatcatt taaaaaaaaa atggtaacaa ttatctgcag tcttctctag 107520 
aaaacagatg aaaagggaac cattcctaat tcatatactt gctgacttgt tccatgaggc 107580 
cagaaatgcc ctaattctaa aaccaatgat attaaaagaa aggfaaaacta cagaacaata 107640 
tccctcatga acataaatgc aaaaattctc aataaatatt agcaacttaa atttttgttt 107700 
aaaggtataa tcatgtatag aaagaattat agtccgtggc caagtgagat ttattccagg 107760 
catgaaaatc tggttaaata tttaaaaatc aattaatata atctatcaca caaacaagat 107820 
aaagaaggaa aattatgtga tttttccatc aatggacaca ggaaaagcat ttaacaaaat 107880 
tcaacatcca tttataataa aaactcagca aactggaaac aaagagaaac tgcctcaact 107940 
tcattaacag catctaaaaa aaccatacag ttaacctcat atttaatggc aagaaattag 108000 
gtgcttttgt cctaagactg ggaacaaggt aaagatgtcc tctctcgcca ctcccattca 108060 
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acactgaact agatgtctta gataatttaa caggacaaga aaagtaaata aaaagtttat 108120 
acattggaaa agatgaaata aaaatgtctt tgttaataaa tttgtaattt tctatattaa 108180 
aatctgaaag aatagacaag aaaaataaaa acaaacaaaa aagaaacttc cagacctagt 108240 
aggcaaaaaa atggcaacac tgaagaatag aaggttaata tacaaaagtc agttgttttt 108300 
ttccagcaat taccaagttg agattgaaat taaaaacaca ataccattta tgtgaacacc 108360 
aaaaaatgaa acttaggtgt aaatctaaca aaatatatac aagatctatt gaagacgact 108420 
acaaaactga tggaaaaaaa agcaaatatc taaataaatg aagagatgtt ccatgttcat 108480 
gaataggaag acttaatatt gttaagatgt ctattctgcc caacgtgatc tataaattca 108540 
atacaattcc aatcaaagtc ccaacgagat attttatcaa tataaacaaa ctctaaaatc 108600 
atatggaaag gcaagagaca cagaataacc aatgcaatat taaagaaaaa gagcaagttt 108660 
gaacaaccaa cactacctga ttttaagact tactataaag cttcataatc aagacagcat 108720 
ggtattggta aaagaatcaa caagtagatt gatgaaacag aacagagagc ccagaaatag 108780 
acatacaaaa agatagtcaa ctagtctttg acaaatgagc aaaggcattg agagatagcc 108840 
ttttcaagaa tggctgtctt agttgaaaga gtggttgaat atccacatgc aaaaaaaaaa 108900 
aaaaaaaaaa aacttaatgt agaccttatg cctttcacaa aaaccaactc agtttagatc 108960 
atagatataa cataatgctc aatactatta tataaaacga ctagaagata acataggaga 109020 
agatttaagg gacctactct ttgtctatga gtttttagat acaacaccag aagcataatc 109080 
catgaaaaat gattgaaatg ttggacatta tgaaaattta caactttgca aaggacactg 109140 
tttaagagaa tgaaaagaca gccctcagaa tgggagaaaa tatctgcaga atatatattt 109200 
cataaaggat ttgtatccaa atatgtacaa agaactctta aaactcaaca ataagaaaac 109260 
aactcatttc aaaaacggcc aaaagatctg aataaaccct tcaccaagaa gatggcaagt 109320 
aaacagatag aaagaagctc aaaatcatat gccattaggg aattgcaaat taaaacaaca 109380 
gtgagacaac actacacatc tattagataa tcactaaaat tcaaaaaaaa ctgaaaatac 109440 
caatggctga tgaggatgaa tattaacagg aactttcatt cattgctgct gggaattcaa 109500 
aatagtacag tcactttgga agataattta gcagtttctt acaaaactaa gcatggtctt 109560 
accatacaac ccagcaattg taaggctatg tatttactca gttgagttga caacttatat 109620 
ccacagaaga atgtttattg tagctttatt cataatcacc aaaaactgga agcaaacaag 109680 
aggttcctca aaagctgaat ggataagctg cggtacatcc attaaatggg atattattca 109740 
atgatataac aaaatgggct atccagccat gaaaagacat ggaggaatct taaatgccca 109800 
ttgctaagtg aaagaagcca gtctgaaaaa gctacatact aaatgattca actatattac 109860 
attctcaaaa aggcaaaact gtagaagagc aagaaaccag tgttgccaga gatttggagt 109920 
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aggaaagaga tgaatagata aaagcagtgg atttttagga cagtgaaact cttctgatat 109980 
tataatggtg ggtacataac attatgcaat ggtcaaaacc catagaactg tccaacacag 110040 
agtgaacctt aatataaaca atggacttta gttaatgata atatatccat attggttcat 110100 
cagttttaac aaatgaacta caagatgtta atgacaagga aaagtatata gatgggaggg 110160 
agtcataggg aaaccctctg tgttgtttgc aattttttgt aaatataaaa ttgttctgaa 110220 
aaataaagtc tattttctta aagtcagtcc agggatcctc ctttatatga atacaatctg 110280 
gcattttgct aagtaaaatt tctctgtttc cagcatttaa gtctcacttc tctgtgcaag 110340 
attttcttta gttttatatt tcctaggtca tttctgtcag aattaagggg tggggtagtg 110400 
agaattcaga catatgtcct caagctgcca tctgttctcc aaatggtatc ccatatgttg 110460 
ttgatgtgta gagttttcat tgttgacatt tatgaattga ctataattgt actttgattt 110520 
ccttcttgac tcgagttaat tatgagggtg tttaaatttc caagtggtat tttttaaaaa 110580 
aacatgttac taatttctgc ttcattgctc attgttttaa atatgtgatt tgcgtaagtc 110640 
ttacttttta gagctattga gagggttttt tttcttggta gtttagtata tcgccaattt 110700 
ttgaaatgtc tagatatttt atagcaatgt gtactattct tatgttatga agtctatttt 110760 
tttctatctg tatatttttg tttattcttc tgtaaaactt tgaacggttt cattatatat 110820 
ttagcattat gttacttgtt acatataact ttgtgaatat cctattgcca tcatgtgttg 110880 
tatcttttat caatataaat aatcctttat cacatgaaat gctgtttgct ttgaatttaa 110940 
tttgtctaat agcaatgttg taatgaatgc cttctttaat ttttactttc ttgatatacc 111000 
tttggccatc cctctatttc cacacttaga tatctaagtg atgtggttta aaatttttct 111060 
cttttaaaca gcatagaatt agattttagg gaagttttaa aataagttat tttcttaatg 111120 
aaagacctta agtcattcac atgtattgcg acaaatagtg ttttggtttg gtgtaatttc 111180 
agttccttat tttagtcttt ctatattaag ggttttttgt ttgtttgttg ctatttactt 111240 
gttttttctt tttatttgat tatcaattgt ggtaggcttg aaaaaagctc cccgccccta 111300 
ccccccacca ccaccaccac cacaaaatcc gtatcctaat ccctggaacc tgtgatgtta 111360 
ccttatataa gaaaagggaa cttcacaaat gtaatcaaat taaagatctt aaaactggga 111420 
gattattctg gtttatccca gtgattccca catgtaatca taacagggtg gtagaaggga 111480 
gtgtgacaca cagacagaag aggaaaaggc agagcttggg ccgatatggt cacacgccaa 111540 
ggactgctgg cagtcacccg aagccagaag aggcaagggg cagattctcc cctagagtct 111600 
ctggaggact catgggcttg ccaaactttt atttttgccc agtagaacta attttggact 111660 
tctggtctcc agagctgtgc gagaataaat ttatgttgtt ttaagccata aagtttgtag 111720 
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tagtttgtta cagccaccat aggaaactaa ttaaaatttc tccattttta ttttactcta 111780 
tcatagctta tctttaactt ttagctttaa tatatctcta tcaaggttaa aaaacaaaag 111840 
agagacccaa aatacataac aaaattgaac atgatgtctt tctttgtact ctgtcccttc 111900 
actcaactcc cttaagtttt tcctgaaatg ttccatagtc atatttataa ttattttgta 111960 
agcatttcat gtgtttctaa cttcattgct taccatagtt gttattattc tatgctcttc 112020 
cctcttcttg aattcttaat tttaattcaa ttctcaatta gctgaagtat gccaggaaga 112080 
ctataaaata ttatattttc ttccttctta tatatctgag aaaactgttc tgtgccttgg 112140 
cacagaaatc atatttacaa agttcttgaa tcacaatttt ttttcaaaac tctgaacatg 112200 
ttgttatagt gttttctggc attacttttt gtgggaaaga ggtatgataa tttaagatgc 112260 
atttctttta ggtaactttt aaattttttt accctgcctt aaatgcctga agatggttaa 112320 
tctctgaaat taaatactct tttgattgta tctcaaagtc agcatctttt tatcaacatt 112380 
gactgctctt ctggacactc ttatttatgg ctcttcctcc tgctccagat tctgaactgt 112440 
gcctgtcacc tccctcaaga cagccacaga gccccaagat ccttataacc tcaccatgat 112500 
gttccactcc acagggttct gccattgtgt gctttctttt tcatccttct ctgatgccac 112560 
ctcctcctct gtttgttgcc agccttaata attttaaatt catggatcac aggtagaaac 112620 
cccagttttg tctaagtaga cgatcgattt tttttttctt aaagctagtc ttccattttt 112680 
ctctttctct atttgtcacc ttttcccaag agataatttt ttcttttagt tttcagagga 112740 
gccttcaatt tcttttttta aacctttttt ctaactttta ttttaagttg aggggtacat 112800 
gtgcaggttt cttatacagg taaacttgtg tcatgggggt ttgttgtaca gattatttaa 112860 
tcacccaggt attttttttt tttttttttt ttgagacgga gtctcgctct gtcgcccagg 112920 
ctggagtgca gtggcgcaat ctcggctcac tgcaagctct gcctcccggg ttcacgccat 112980 
tctcctgcct cagcctcccg agtagctggg actacaggcg cccgccacta cgcccggcta 113040 
attttttgta tttttagtag agacggggtt tcaccgtttt agccgggatg gtctggatct 113100 
cctgacctcg tgatccaccc gcctcggcct cccaaagtgc tgggattaag cttcatattc 113160 

i 

attagttatt tttcctgatc ctaaccctcc tcccacccta caccctccta taggccccag 113220 
tgtgttttgt tcccctctat gtatctgtgt gttctcatca gttagcttcc acttataagt 113280 
gagaatatac agtattttgt tttctgttct tgcattagtt tgctgaagat aatgccctcc 113340 
atccatgtcc atccatgtcc ctgcaaagca cattatctca ttctttctta cgagccttca 113400 
atttctaatc attcctcaga acctcctccc tatggtggct ttcacaacat tagtgctgag 113460 
atgcatttgt agaaataaca aatcataaag aaccacaatg tgcttgattg ttatttgtct 113520 
gagccagaga aagaacaaaa aggcacgaat tcagaacaaa gataactgcc aagaagaaag 113580 
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tgaagtaaat gctctccatt agacttcaat ctacaagtta ctcttctact ccctctggat 113640 
ttgaacctaa tgttacaaag tggagataca gccagactct tgatagaaat gaggctccaa 113700 
gttacctgaa aaatgtccac cttgatcatc ccttcaaata tatttcattt aatcacattc 113760 
taaatatgct cttcacataa gagaaattaa cttcttaaat cccaatttcc ctttctgcta 113820 
attatgcaaa aaaagattgt acacagataa ttaaaaggtg tgggtaaagt taataaaaga 113880 
gacagtggga aggaagagga gacccagatc ctgagcctaa tgaagtggat gaaaaagggg 113940 
agctctctga tggtgtcagt gctgatgatg ataaatgatt caacaccgca cataggctcc 114000 
ctaaaaccgg aacccttcta tcatctcttg actcccactg ataaaaagca aatgaacttt 114060 
tgtttctaat tattctggaa tgtcaattgt ggcagaaaaa attagtttag ctttaaaagt 114120 
caattccaaa caaatgtacc atgcaaatgt caaggactga tcctgaaata tggagtttcc 114180 
aagaggccaa gggaataagt gagcaagaaa gaactgaagc attgtcactc atcatggcca 114240 
gcagtacaaa ttctagcacc aggaaccaaa ctctgcttcc tggtgtggag atgaaaaggg 114300 
agagttccct ggtccatcca tcacaggaca tgggacaggg gtatggctct ctgttcagct 114360 
tcctcaagct caaactcctt ataggagggg gagcatgcag aagggtaggt gcaggagcca 114420 
gggcaagtgc attgaactcc ggccccacag tagtgtctgg ggtgggtgcc tgtgactcct 114480 
gaagccccag tgttatagtg ctgttttaac tctgccatct gcagatggct taagtggtaa 114540 
ccagctcagt gccctcttgg tggccatgtc ctcgtccata ggccaggaag agtcaggtca 114600 
cacacagact tgaaggatga atgcaggagt tttattgagt ggtggaggtg gttctcagtg 114660 
ggatggatgg ggaactggaa gggggatgga gtgggaagat gatcttcccc tggagtttgg 114720 
ctgtccaggg ccattctcct gtctgattgt ccccagctga actcttctta gtgttcagac 114780 
cctccttctc ttctctcctt ctctgccatg ctgttctgcc attcgtctgc tcatctcctc 114840 
ctggagcctg ggatttgggg tttatatggg tacaggatag gggcatgtag ctggccaaaa 114900 
ggcaactttt tgggcacaaa aacaggaata cctgttctca tttaagacca ctggtttcca 114960 
ggcttgaggg tggggtcttt gctgggaaac tgccctcttt cacccagtat ttccctttct 115020 
cttgtccatg taagagagtt atttcccttc ctaatccaac actcacccct tctttggttc 115080 
acctctcaga tttctaaata acacaaggtg ctattctttc cattatgtgc cttccctcac 115140 
ctctctgaag tcataagctt cctctcattt ctgtctttag gttgtgctgc tctacgcctc 115200 
tacctacgtc ctggtgtccc tcagcataga cagataccat gccatcgtct accccatgaa 115260 
gttccttcaa ggaggtgagc tggctttacc aggtgctctt tcataaagaa ctgtccttga 115320 
gtgagggtag gatcattttc actaatattt tactcttaat aatagtgatc ttcctcctct 115380 
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ttccttctct ttcttctctc ttcattctcc tcttccacct ctttcctcac ttcatgggtc 115440 
tgattgtcta aaaagatgta caaaagaaaa cttagatgcc tggaaaaatg ccgatggttc 115500 
ccatggacag cccagagcca ctaaggaagg tgggtggctg acccactttg tgaccattca 115560 
gcttgatgct gatgcttcca tctttttoct ctagagcagt gttgacatgc atacctgtcc 115620 
aaagaaatga actctcccao actaaottct cattttccag aaacaaacac cttgttctcc 115680 
ttatccaaaa ccatagaaat aattggaaag atgtttttaa ttgatcatcg tattgggaaa 115740 
aaaaaataag gaaactctct ttgatccaga aatggttagg gatccctaag aaaaggaagc 115800 
agatgggtca gagggaaggc tgagaccaca cagcocacaa aacccctagg agatgctgaa 115860 
gtgggatggg aggccttagg agcacttcct gaacaggctg catcaatgtg gagtgtagga 115920 
gaggacccga ggcaactgac aaggtgattt ctgaagtaca acacagagga ttaatcacag 115980 
caagtttccc tggatttgaa gaggcagaga ggggccotgg ggagcagtaa tgccctagag 116040 
gaagggtgag tacatattoa caagactagc tgcctactgc tctgcccaca atcagagacc 116100 
actctcctcc atacaccctg aaagcaaata caaagccatt gtttagcaac ctgttctccc 116160 
ttcctccaga taacttcgaa aactgataaa gaacctaaca aaaaatcttg accacaaaaa 116220 
tgagtgcaat caaatattat ttaaaaagtg acaaaatcca aactgtagaa aacagataat 116280 
acctttaaaa totcattcca oacaatttgg atctagggaa gcaaatttaa tagaagtagc 116340 
agaattttag gataaatata attaacgtca tcaaggatct acaagaagtg gagtgaaata 116400 
gggtaatttc atttactatt gaoaggggta taagtaaaca catottattg aaatataaaa 116460 
tgtgcagccc tttgatatga cttcttggta tctgctctgg agaaacaccc acacgtgtgc 116520 
atgtatacat atgtgatgta tattgtggca ctgcctacaa tagcaggaat tcatcgctaa 116580 
ccttggtgtg tctcagtgga ggaatggcta actagtctat ggaagaagtt tcagtacctt 116640 
ctgtttattg attgcatact aggtgtcatt oatgattcta aotgattcat acatattaaa 116700 
tattttaata ttcataatat ctctgtgagg tgggctctat tattatctgc attttacagg 116760 
taaggcaatc gttgttccaa gagtgaaagt aatttgcooa aggtoacagc tagtaagtgc 116820 
aggaaoagga tttgaaccca cacagtctgg gtctagctcc agcgccctca aatgctatat 116880 
agcctgcctc tcaatgtgga agtggtagag ccatgggcag taatatgcat caacatgaac 116940 
atatccatat tgtatgctga gtgacaaaaa caatatgcac aaaaatccat gtgaacgata 117000 
acattaaagt caaaaactca aaacaatgct agatattgtg tacagaaaga cagagaggag 117060 
cagaaagaga gaacaoacaa aagaaaactc caggaaaaat aaaaaactat ttgcaaaaac 117120 
acacaaaact tgaoatttga cctcacttct agttatggga gttagaccag tggtgtgaco 117180 
accacaggaa cttatatgca aggtggtaat ttttacatga agaaoatatc aatgtgttgt 117240 
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gtgcaattaa aatgaattta ttttgatcat gcttgttatg cctactgtac agcataagac 117300 
atatgagtaa aaatactaaa ttgtatactt aagaatttgc tggggatctg ttccaagatg 117360 
gccaaatagg aacagctctg gtctgcagct cccagcatga tcgatgcaga agacaggtga 117420 
tttctgcatt tccaactgag gtgcctggtt catctcactg ggactggttg gacagtgggt 117480 
gcagcccatg gagggtgagc caaagcaggg cagggcatcg cctcacctgg ggagcacaaa 117540 
gsgttggggg atgtcccttt cctagccaag ggaagctggg gcacactgta cctgggacac 117600 
tcctgccaaa tactgcactt ttcccaagga tttagaaacg agcagacaag gagattctct 117660 
cccatgcctg gctcaggagg tctggagcct tgctcactgc tagcacagca gtctgaaatt 117720 
gaactgtgag gcagcagcct ggctagggga ggagcgtcca ccattgctga ggcatgagta 117780 
ggtaaacaaa gtggcccggc agctcaaact gggtggagcc cactgcagct cagcaaggcc 117840 
tactgcctct ataggctcca cctctgtggg caggacatag ctgaacaaaa ggcagcagac 117900 
agcttctgca gacttaaacg tccctgttcg acagctctga agagcacagt ggttcttcca 117960 
gcatggcgtt tgagctctga gaatggacag actgcctcct caagtgggtc cctgacacct 118020 
gtgtatgcta actgggagac atctcccagt aggggccaac agacacctca tataggtggg 118080 
tgcccctctg ggacaaagct tccagaggaa ggatcaggca gcaatgtttg ctgttctgca 118140 
atatttgctg ttcacagcct ctgctagtga tacccaggca aacagggtct ggagtggatc 118200 
tccagtaaac tccaacagac ctgcagctga gggacctgac tgttacaacg gaaactaaca 118260 
aacagaaagg aatagcacca acatcaacaa aaaggacatc tacaccaaaa ccccatctgt 118320 
aggtcaccaa tatcaaagac caaaggtaga taaaaccaca aagatgggga gaaaccagag 118380 
cggaaaagct gaaaattcta aaaatcagag cgcctcttct cctccaaagg actgcaactc 118440 
cccgccagca atggaacaaa gctggataga gaatgacttt gacgagttga cagaagtagg 118500 
cttcagaagg tcggtaataa caaactactc ctccgagcta aaggagcatg ttcaaaccca 118560 
ttgcaaggaa gctaaaaacc ttgaaaaatg gttagacgaa tggctaaata gaataaacag 118620 
tgtagagaag accttaaatg acctgaagga gctgaaaacc atggcacaag aacttcgtga 118680 
ggcatgcaca agcttcaata gccaattcaa tcaagtggaa gaaagggtgt cagtgattga 118740 
agatcaaatt aatgaaataa agcgagaaga catggttaga gaaaaaagat taaaaagaaa 118800 
caaacaaagc ctccaagaaa tatggtacta tgtgaaaaga ccaaatctac atttgattag 118860 
tgtacctgaa actgatgggg cgaacagaac caagttggaa cacactcttc atgatattat 118920 
ccagaacttg cccaaaatag caaggcaggc caacactcaa attcaggaaa tacagagaac 118980 
accacaaaga tactcctcga gaatagcaac cccaagacat gtagtcagat tcaccaaggt 119040 
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taaaatgaag gaaaaaatgt taacggcagc cagagagaaa ggtcaggtta cccacaaagg 119100 
gaagcccatc agactaacag cagatctctt ggcagaaacc ctacaagcca gaagacagtg 119160 
ggggccaata ttcaacattc ttaaagagaa gaattttcaa tccagaattc catatccagc 119220 
caaactaagt ttcataagtg aaggagaaat aaaatccttt acagaaaaac aaatgctgag 119280 
agattttgtc accaccaggc ctgccttaca agagctcctg aaggaagcac taaatatgga 119340 
aaggaacaac cagtaccagc cactacaaaa acatgccaaa ttgtaaagac cattgatgct 119400 
aggaagaaac tgcatcaact aacgggcaaa ataaccagct aacatcataa tgataggatc 119460 
aaattcacat ataacaatat taaccttaaa tgtaaatggg ctaaatgctc caattaaaag 119520 
acacacactg gcaaattgga taaagagtca ggacccatca gtgtgctgta ttcaggagac 119580 
tcatctcatg tgcagagata cacataggct caaaataaag ggatggagga agatctacca 119640 
agaaaatgga aagcaaaaaa aagcaggggt tgcaatccta gtgtctgata aaatagactt 119700 
taaaccaaca aagatcaaaa gagtcaaaga aggccactgc ataatggtaa agggatcaat 119760 
tcaacaagaa gagccaacta tcctaaatat atatgcaccc aacacaggag cacccagatt 119820 
cataaagcaa gtccttagag acctacagag agacttagac tccaacacaa taataatggg 119880 
agactttaac accccactgt caatattaga cagatcaacg agacagaagg ttaacaagga 119940 
tattcaggac ttgaactcag ctcagcacca agcagactta atagacatct acagaactct 120000 
tcaccccaaa acaacagaat atacattctt ttcagcaccg cattgcactt attccaaaat 120060 
tgactacata attggaagta aagcactcct cagcaagtgt aaaagaacag caatcacaac 120120 
aaactgtctc tcagaccaca gtgctatcaa attagacctc aggattaaga aactcactca 120180 
aaaccgcaca actacatgga aactgaataa cctgctcctg aatgactact gggtacatag 120240 
tgaaatgaag gcagaaataa agatgttctt tgaaaccaat aagaacaaag acactatgta 120300 
ccagattctc tgggacacat ttaaagtagt gtgtagaggg aaatttatag cactaaatgc 120360 
ccacaagaga aagcaggaga gatctaaaat tgacacccta acatcacaat taaaagaact 120420 
agagaagcaa gagcaaacaa attcaaaagc tagcagaagg caagaaataa ctaagatcag 120480 
agcagaactg aaggagatag agacacaaaa acccttcgaa aagtcaatga atccaggagc 120540 
tggttttctg aaaagatcaa caaaattgat agactgctag caagactaat aaagaagaaa 120600 
agagggaaga atcaaataga tgcaataaaa aataataatg tggatatcac taccaatccc 120660 
gcagaaatac aaactaccat cagagaatac tataaacacc tccatgcaaa taaactagaa 120720 
aatctagaag aaatggataa attcctggac acatacaccc tcccaagact aaaccaggaa 120780 
gaagttgaat ctctgaatag accattaaca ggctccaaaa ttgaggcaat aattaatagg 120840 

■ 

ctaccaacca aaaaaagtcc aggaccagac caattcacaa ctgaattcta ccaaaggtac 120900 
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aaagaggagt tggtaccatt ccttctgaaa ctattccaat caatagaaaa agagggaatc 120960 
ctccctaact cattttataa ggccagcatc atcctgatac caaagcctgg cagagacaaa 121020 
acaaaaaaaa gagaatttta gaccaatatc cttgatgaac atcgatgtga aaatcctcaa 121080 
taaagtactg gcaaactgaa tcctgcagca catcaaaaag cttatccacc atgatcaagt 121140 
cggcttcatc cccgggatgg aaggctggtt caacatatgc aaattgataa acgtaatcca 121200 
tcacataaac agaaccaatg acaaaaatca catgattatc tcaatagata caaaaaaggc 121260 
cttcaacaaa atccaacgcc cttcatgcta aaaactgtca ataaactagg tattgattga 121320 
acatatctca aaataataag agctatttat gacaaactca catccaatat catactgaat 121380 
gggcaaaaac tggaagcatt ccctttgaaa accggcacaa gacaaggatg ccctctctca 121440 
ccactcctat tcaacatagt gttggaagtt ctggccaggg caatcaggca aaaggaagca 121500 
ataaagcgta ttcaattagg aaaagaagaa gtcaaattgt ccctatttgt agaccacatg 121560 
attgtatatt tagaaaaccc cattgtctca gcccaaaatc tccttaagct gataagcaac 121620 
ttcaccaaag tctcaggata caaaatcaat gcgcaaaaat cacaagcatt cctatacatg 121680 
aataagagac aaacagagag ccaaatcatg agtgaattcc cattcacaat tgctacaaag 121740 
agaataaaat acctaagaat ccaacttaca agggatgtga aggacctttt caaggagaac 121800 
tacaaaccac tgcgcaagga aataaaagag gacacaaaca aatggaagaa cattccatgc 121860 
tcatggatag gaagaatcca tatagtgaaa atggccatac tgcccaaggt attttataga 121920 
ttcagtgcca tccccatcaa gctaccaatg actttcttca cagaattgga aaaaactact 121980 
ttaaagttca tatggagcca taaaagagcc tgcattgcca agacaatcct aagcaaaaag 122040 
aacaaagctg gagacatcaa gctacctgac ttcaaactgt agtacaaggc cacagtaacc 122100 
aaaacagcaa ggtacttgta ccaaaacaga gatatagacc aatggaacag aacagagccc 122160 
tcagacataa caccacactt ctacaaccat ctgatctttg acaaacctga caaaaacaag 122220 
aaatggggaa acgattccct atttaataaa tggtgctggg aaacctggct agccatatgg 122280 
agaaagctga cactggatcc cttccttaca ccttatacga aaattaactc aagatggatt 122340 
aaagacttaa atgttaaacc taaaaccata aaaaccctag aagaaaacct aggcaafcacc 122400 
attcaggaca taggcatggg caaagacttc atgactaaaa caccaaaagc aatggcaaca 122460 
aaagccaaaa ttgacaaatg ggatctaatt aaactaaaga gcttctgcac agcaaaagaa 122520 
actaccatca gagtgaatgg gcagcctaca gaatgggaga aaatttttgc aatctaccca 122580 
tctgaaaagg gctaatatcc agaatctaca aagaactcaa acaaatttac aagagaaaaa 122640 
acaaccccat caaaaagtgg gcaaaggata tgaacagaca cttctgaaaa gaagacatct 122700 
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atgcagccaa cagacacatg aaaaaatgct catcatcact gatcatcaga gaaatgcaaa 122760 
tcaaaaccac aatgaaaaac catctcatgc cagttagaat agtgaccatt aaaaagtcag 122820 
gaaacaacag gtgctggaga ggatgtggag aaataggaac acttttacac tgttggtggg 122880 
agtgtacatt agttcaacca ttgtggaaca cagtgtggcg attcctcaag gatctagaac 122940 
tagaaatact atttgaccca gcaatcttat tactggtata tacccaaagg attataaatc 123000 
atgctactat aaagacacat gcacacgtat gtttattgct attgcagcac tattcacaat 123060 
agcaaagact tggaaccaac ccaaatgtcc atcaatgata gactggatta agaaaatatg 123120 
gcacatatac accatggaat actatgcagc cataaaaaag gatgagtttg tgtcctttgc 123180 
agggacatgg atgaagctag aaaccatcat tctcagcaaa ctgtcacaaa gacagaaaac 123240 
caaacactgc atgttctcac tcataggtgg gaactgaaca acgaaaacac ttggacgcag 123300 
ggtggggaac atcagacacc ggggcctgtc agggagtgag gggctggggg agggatagcg 123360 
ttagcagaaa tagctaatgt aaatgacaag ttgatgggtg cagcaaacca acatggcaca 123420 
tgtataccta tgtatccaac ctgcacattg ttcacatgta caatgataat ttaaaaaaaa 123480 
gaatttgcta acaagataaa ccttatgtta agtgctcttg ccacaaaagg aaaaaactaa 123540 
acaaaacaat actaaagggg gacaggaaga agcctttgga ggtgatggat acatttatgg 123600 
cattgcttgt aatgatggtt acaggaatgt atacatatct ccagacacat taagtggtat 123660 
gcattggata tgtatagctt tttatatgcc aattatacat caataaagtg gtttagaagt 123720 
taattttaaa aatagaaaat atacttgttt cctgggtttc ttctcggtca tttggtgtgt 123780 
ctgttttgtg aagggttgag acctgtacca agttttctgg gcactgtaaa gccctagtta 123840 
ttatgatagg ggaggcagga aattgatgag gcaggggaga taggagagac atttcgtgac 123900 
ataggtatgc aaaaatgtgt caatgaagca tcaaattctg taaaatattt gtagagattt 123960 
attctgagtg aaatatcagt gacaatggac aaagccctca ggaggtcatg agaatatgtg 124020 
cccaaggtgg tcagggtgca gcttggtttt atacatttta gggcgacatg aaccttcaat 124080 
caaacacatt taagaaatac atagggttgg tccgaaatgg cagggttggg tagtggggga 124140 
tggggtgggc gaagcttcca gattatagtt agatttaaat tttttctggt tgacaattgg 124200 
ttgagtttat ctaaagacct gggatcaaca aaaaggaatg tctgagttaa gacaaagaat 124260 
tgtggagacc aaacttctta tttgcagggg aaacttttag gtttcctctt cagggagact 124320 
atgttgtaaa atgtttctca tcagacttaa agtctgtgtt gatgttaacg ctggagaggt 124380 
ataatgagac acacctgaac ccgcactgcc catcatggct tggaacagcc tctcaggtta 124440 
aatcttaaaa gagccttggc tgaggaggaa atccattcgg aaggttgggg ggccttataa 124500 
ctttattttt ggtttacatt catggatttg tgatacactt tttttttttt tttttttttt 124560 
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ttttagacag agtctcactc tgtcgcccgg gctagagcgc agtggcgcga tctcagctca 124620 
ctgcaacttc tgcctcccag gttcaagtga ttcttgtgcc tcagcctccc gagtagctgg 124680 
gattacaggt gtgcatcacc acgcctaatt tttgtatttt agtagagatg gggtttcacc 124740 
gtgttggcca ggctggtctc aaactcctgg gctcaagcaa tccacctfcgg ccttccaaag 124800 
tgctgggatt ataggcatga gacactgcac ctggcccgtg atacacttat ttatccatct 124860 
atttgtttat gcctagccga ctttagaaag ggttgaatgt ggtggacaaa aggacatcga 124920 
cgccaaagta ttttaaggtg aagaaatata agtgaaggaa aaataagtat aggaaattgc 124980 
aggagggaag tcccaggaaa cttgttagag gtacagactt gagacccata gccagagggt 125040 
gacatgatga attacatgtt tcaccatatt cactaggtta aaaaattagc tgtttgctcg 125100 
tggaaaatat gtctagttct gggagacact gtgtaatatc atcaagagtg tcctaaatta 125160 
tcagaaaggc agcaattaat attatagtgc ttttctaaag atgtaaaaca acttattccc 125220 
tcactacaaa attaagagag gctcaacata aggtgcttgg aaaatcctca tcatcagtag 125280 
cttctgggaa agatgaaagc ttagaaacac attagccagc tgccttcctc tgcccaggcc 125340 
caacgaaagt cactgaagaa atctaaaaac agggggagac ctgcatctgg aatgagagcc 125400 
aaggaaagaa gccatccaaa taccagaaac ttctaagaat tcccagcaca cgagggagca 125460 
gatgaaacac ggaagccaag tgtataatgt ataattcagc ccccgggaga actccctggc 125520 
cttgtagagg atgaactgat actcacaagt ttatagggtg agggcagccg gtgagaacgc 125580 
gacacgagag gttccaggaa gcccaacgcg ggagtcattt tagtaccgca gcttgtcaga 125640 
atttaagggc agctgagagg cagggttcgg agtttatgcg gctcagctga gagaaagccg 125700 
tagaaagaag agattccccg agagtggcag ggctctctgg gagagtcatg gagcagggag 125760 
aaaccaggag ggagtcaaat gtagctgtgc aggcacaagg caatggccgc ttaactgagg 125820 
ggaggccata ttttatttat tttttactgc tatttactga agcagtattt ttcttagcca 125880 
gcaactccag aaaagttgac aattttaaag gtcttcacca aaaatgaata tgagaataat 125940 
tagaaaaata tcaaaataat aaagagaaat atgcagtgtt atattaaaat gtaatataaa 126000 
actttgataa ttaaaacaat gctggaacca gaataacaca tataaatcat acaaaatttc 126060 
aaatcagttg gcaaaaaatg aattagtata cggtgttgaa atagctgtct ggcctttttt 126120 
cagaatatta gatcaccacc caccatagct aataacaaag taaattttag ttggattgaa 126180 
aaaattaaca cttcaaacat aaaaggagta aaagaatatt tgggtaacac cttcataatt 126240 
tcagtgaagg aaaggccttt ccaggaacca tatcaaaagc agaagcaaga aggaaaatat 126300 
agttttttac tacataaaaa tttaaattat ttatgtaaat aataaatatc cataaacaaa 126360 
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cttaaaaagc aaaattataa atggaaaata tttttatgta acaggcaaag aaaaatatcc 126420 
tttttataaa gaagtctggc aaagaaataa aaacattcta attgagtaaa tgggcaaagg 126480 
gcatgaacag actaacaaga aacaaagaaa cataaatgag accaggcgct gtggcccacg 126540 
ccggtaatcc cagcacttcg ggaggctgag gtgggtggat cacttgaggt cacaagtttg 126600 
agaccagcct ggccaacata gtgaaaccct gtctctacta aaaaaataca aaaattagct 126660 
gggcatggtg gcatgcacct gtaatcccgc tactcaagag gctgaggcac aagaatcgct 126720 
tgaacctggg aagcggaggt tgcggtgagt ggagatggca ccactgcact ctagccctct 126780 
agcattggca acagagcaag acttcaccag aaaaaaaaaa aaaaaaacaa caccataaat 126840 
gaatagaaaa tgtatagaaa tattcatttt gactactaat taaagaattg caaaattgaa 126900 
catttggata caaaatttta ccaaccacat aacaagatta aaaggaagaa ttatccaact 126960 
attaggagag tataaggaaa ttggtacttt tgttgttaca gtgagagcca taaatagtag 127020 
gcacttttga gggaaatgta acaatatttg tcaaaattta taaatcttct gtttctaaga 127080 
gtttatccta aggaataaca agaactgaga aaaatagatg aatataaatt atttataata 127140 
aaaaatggaa aacaacttag ataataatcg gctattgata aaattggagt acagtataac 127200 
caccggaaaa aatgttattt atcagtgctt ttccaattgc acttcccaca gtgaccccca 127260 
gagggctcat tttgctggcc tgctccaggg cagatctggg agctgctggc tgtctcagtt 127320 
tcaaccaaag atacctgagc tttcaacact ttcctatact gggttcgaga taaaattata 127380 
tttgaagaaa ggatcttact acctgaaaaa aaatgccact tgtacaagtt catatttctt 127440 
aacaggaaaa gatggtcata acaggctttt cagaggaaaa aaattaggtt acaaggtagg 127500 
atctgtatca tgatctcatt tctaaatgtg taatctatgt: attatatatg catgcgttat 127560 
caaaatttaa gcttgacgtt tctggttggt gccactatgg atttttttcc ttttcgctta 127620 
tttttatttg ctagtgtttc tataatcaat gtgtgtaact tatacgattt tataaaaagg 127680 
taaatgagtt acatggaaaa gaggaaggct aagcggcaga gaagtacaga aatggacaag 127740 
agcacatgga agagggagtc tgactttccg ggggtgagaa acatgagaca gtctacctta 127800 
tatgcaaacc tgcatttcat ctgttcgatc ctggatcctt cagccctgag aagtcactct 127860 
tcatggcaaa tgggctcagc tgcctggaag gccatcagtc tccctgctgt ggcagctgcg 127920 
tcacagccta agagagtggg gctttcacag tgtagcccca gcaccagtaa gagaaaaaca 127980 
gctccaagcc tataccctgc tgggcctttg gggaaacatc ttcggagacc ttggtagaat 128040 
tgattctaaa gtatgattct atactctctt ccctgtccat ccctctcaaa agtgaatgca 128100 
ctgttcatcc atcaagtgga aactttgtgg ggcccaggag tcacacgttg cccttaagag 128160 
tgccctggaa cctatagaag aagatgacat tagaattcta accattgaaa gaacaagagc 128220 
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acagggggca gctactaatg tcacccagga ccatagaaac ccatcaggct tcacaccccc 128280 
accatcaaga agaaccccct ccagtcctat tacctgccct cttcatccca ccctgcttct 128340 
ttcttaatct gaactcccac tgcttctctc cacgcctggt ttttctctgt tcctttaggg 128400 
accagctccc ctatgccatc aaactcctaa tgggaggctc cctccacctc ctgcctcctg 128460 
aggtggctga ttcccttaag acatctcaag aggatgctcc tcaaccttcc tttttaggta 128520 
cctcatggcc tggtggtggt gaagacattt tcctaccgtt ctagaccacc tccaggcctt 128580 
ttctccttca gtattgcata cgcatgccct ctctgtttct ctgatgctca tgctatcagt 128640 
ctagactacc ttctacaaca cctccccata tgtctgtgcc catatggcca tggccctcat 128700 
ccagtgaaga tgttgacact attctcagtc tctcaccttt catactgtcc taatcacagg 128760 
tgactcgatg tctacataaa ctaagtctag tgagagggct tcacatgatg actcacagag 128820 
cttgacatgt tccctgcatt tgaaagcatt gtggaccaat gactacatca cacctgagaa 128880 
tgttgaagag aagctggctg gcgctgtggt cctagcccag ggggctgatg taaaacctgc 128940 
atcatgaagt ttgcctgagg ctgtgttaaa aaagatcctg ccccagacat ttctgccaga 129000 
gaaagaggcc tccctga 129017 

<210> 2 

<211> 1567 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (129) . . (1241) 



<400> 2 

gggctcaggg agggctctgt gcctccgttc agcagagctg cagctgctgc ccagctctca 60 

ggaggcaagc tggactccct cactcagctg caggagcaag gacagtgagg ctcaaccccg 120 

cctgagcc atg cca gcc aac ttc aca gag ggc age ttc gat tec agt ggg 170 
Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly 
15 10 

acc ggg cag acg ctg gat tct tec cca gtg get tge act gaa aca gtg 218 
Thr Gly Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val 
15 20 25 30 
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act ttt act gaa gtg gtg gaa gga aag gaa tgg ggt tec ttc tac tac 266 
Thr Phe Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr 

35 40 45 

tec ttt aag act gag caa ttg ata act ctg tgg gtc etc ttt gtt ttt 314 
Ser Phe Lys Thr Glu Gin Leu lie Thr Leu Trp Val Leu Phe Val Phe 

50 55 60 

acc att gtt gga aac tec gtt gtg ctt ttt tec aca tgg agg aga aag 362 
Thr He Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys 
65 70 75 

aag aag tea aga atg acc ttc ttt gtg act cag ctg gee ate aca gat 410 
Lys Lys Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp 
80 85 90 

tct ttc aca gga ctg gtc aac ate ttg aca gat att aat tgg cga ttc 458 
Ser Phe Thr Gly Leu Val Asn He Leu Thr Asp He Asn Trp Arg Phe 
95 100 105 110 

act gga gae ttc aeg gca cct gac ctg gtt tge cga gtg gtc cgc tat 506 
Thr Gly Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val Val Arg Tyr 

115 120 125 

ttg cag gtt gtg ctg etc tac gee tct acc tac gtc ctg gtg tec etc 554 
Leu Gin Val Val Leu Leu Tyr Ala Ser Thr Tyr Val Leu Val Ser Leu 

130 135 140 



age ata gac aga tac eat gcc ate gtc tac ccc atg aag ttc ctt caa 
Ser He Asp Arg Tyr His Ala He Val Tyr Pro Met Lys Phe Leu Gin 
145 150 155 



ttt ctg ttc tec att ccc acc ctg ate ata ttt ggg aag agg aca ctg 
Phe Leu Phe Ser He Pro Thr Leu He He Phe Gly Lys Arg Thr Leu 
175 180 185 190 



602 



gga gaa aag caa gee agg gtc etc att gtg ate gcc tgg age ctg tct 650 
Gly Glu Lys Gin Ala Arg Val Leu He Val He Ala Trp Ser Leu Ser 
160 165 170 



698 



tec aac ggt gaa gtg cag tge tgg gee ctg tgg cct gac gac tee tac 746 
Ser Asn Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr 

195 200 205 

tgg acc cca tac atg acc ate gtg gcc ttc ctg gtg tac ttc ate cct 794 
Trp Thr Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro 

210 215 220 

ctg aca ate ate age ate atg tat ggc att gtg ate cga act att tgg 842 
Leu Thr He He Ser He Met Tyr Gly He Val He Arg Thr He Trp 
225 230 235 

att aaa age aaa ace tac gaa aca gtg att tec aac tge tea gat ggg 890 
He Lys Ser Lys Thr Tyr Glu Thr Val He Ser Asn Cys Ser Asp Gly 
240 245 250 

aaa ctg tge age age tat aac cga gga etc ate tea aag gca aaa ate 938 
Lys Leu Cys Ser Ser Tyr Asn Arg Gly Leu He Ser Lys Ala Lys He 
255 260 265 270 

aag get ate aag tat age ate ate ate att ctt gcc ttc ate tge tgt 986 
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Lys Ala He Lys Tyr Ser He He He He Leu Ala Phe He Cys Cys 

275 280 285 

tgg agt cca tac ttc ctg ttt gac att ttg gac aat ttc aac etc ctt 
Trp Ser Pro Tyr Phe Leu Phe Asp He Leu Asp Asn Phe Asn Leu Leu 

290 295 300 

cca gac acc cag gag cgt ttc tat gcc tct gtg ate att cag aac ctg 
Pro Asp Thr Gin Glu Arg Phe Tyr Ala Ser Val He He Gin Asn Leu 
305 310 315 

cca gca ttg aat agt gcc ate aac ccc etc ate tac tgt gtc ttc age 
Pro Ala Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser 
320 325 330 

age tec ate tct ttc ccc tgc agg gag caa aga tea cag gat tec aga 
Ser Ser He Ser Phe Pro Cys Arg Glu Gin Arg Ser Gin Asp Ser Arg 
335 340 345 350 

atg acg ttc egg gag aga act gag agg cat gag atg cag att ctg tec 
Met Thr Phe Arg Glu Arg Thr Glu Arg His Glu Met Gin He Leu Ser 

355 360 365 

aag cca gaa ttc ate tagaccctag ggcagtgcca gtgctaggct gagcaccatc 
Lys Pro Glu Phe He 

370 

agetctccca ggtccttgtc acctgettgg gcacgtgcat ggaacccgag ccaacttcac 
cecacccteg teattacctg ggagatgcac aagacaaatg ttctaatgae tgeatgcact 
gettaagtat tggccaacac gaactcccca gttattcatg ccagccagga aggaaaegcc 
ttccttcccc accattccea gecctccttc ecactggcea geacctgaac ccagtgaaca 
caggcattag tggtccaggg teetggcttg gagecagtga gtagac 



1034 



1082 



1130 



1178 



1226 



1281 



1341 
1401 
1461 
1521 
1567 



<210> 3 



<211> 371 



<212> PRT 



<213> Homo sapiens 



<400> 3 



Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
^ . 5 10 15 



Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 

20 25 30 



Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
35 40 45 
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Lys Thr Glu Gin Leu lie Thr Leu Trp Val Leu Phe Val Phe Thr He 
50 55 60 



Val Qly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
^5 70 75 eo 



Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp Ser Phe 

85 90 95 



Thr Gly Leu Val Asn He Leu Thr Asp He Asn Trp Arg Phe Thr Gly 

100 105 110 



Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val Val Arg Tyr Leu Gin 
115 120 125 



Val Val Leu Leu Tyr Ala Ser Thr Tyr Val Leu Val Ser Leu Ser He 
130 135 140 



Asp Arg Tyr His Ala He Val Tyr Pro Met Lys Phe Leu Gin Gly Glu 
^45 150 155 160 



Lys Gin Ala Arg Val Leu He Val He Ala Trp Ser Leu Ser Phe Leu 

165 170 175 



Phe Ser He Pro Thr Leu He He Phe Gly Lys Arg Thr Leu Ser Asn 

180 185 190 



Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr Trp Thr 
195 200 205 



Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro Leu Thr 
210 215 220 



He He Ser He Met Tyr Gly He Val He Arg Thr He Trp He Lys 
225 230 235 240 



Ser Lys Thr Tyr Glu Thr Val He Ser Asn Cys Ser Asp Gly Lys Leu 

245 250 255 



Cys Ser Ser Tyr Asn Arg Gly Leu He Ser Lys Ala Lys He Lys Ala 

260 265 270 



He Lys Tyr Ser He He He He Leu Ala Phe He Cys Cys Trp Ser 

275 280 285 
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Pro Tyr Phe Leu Phe Asp He Leu Asp Asn Phe Asn Leu Leu Pro Asp 
290 295 300 

Thr Gin Glu Arg Phe Tyr Ala Ser Val He He Gin Asn Leu Pro Ala 
305 310 315 320 

Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser Ser Ser 

325 330 335 

He Ser Phe Pro Cys Arg Glu Gin Arg Ser Gin Asp Ser Arg Met Thr 

340 345 350 

Phe Arg Glu Arg Thr Glu Arg His Glu Met Gin He Leu Ser Lys Pro 
355 360 365 

Glu Phe He 
370 



<210> 4 

<211> 1410 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (129) . . (1259) 



<400> 4 

gggctcaggg agggctctgt gcctccgttc agcagagctg cagctgctgc ccagctctca 60 

ggaggcaagc tggactccct cactcagctg caggagcaag gacagtgagg ctcaaccccg 120 

cctgagcc atg cca gcc aac ttc aca gag ggc age ttc gat tec agt ggg 170 
Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly 
15 10 

acc ggg cag acg ctg gat tct tec cea gtg get tgc act gaa aca gtg 218 
Thr Gly Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val 
15 20 25 30 

act ttt act gaa gtg gtg gaa gga aag gaa tgg ggt tee ttc tae tac 266 
Thr Phe Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr 

35 40 45 

tee ttt aag act gag eaa ttg ata act ctg tgg gtc etc ttt gtt ttt 314 
Ser Phe Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe 
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50 55 60 

acc att gtt gga aac tec gtt gtg ctt ttt tec aca tgg agg aga aag 362 
Thr lie Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys 
65 70 75 

aag aag tea aga atg acc ttc ttt gtg act cag ctg gcc ate aca gat 410 
Lys Lys Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp 
80 85 90 

tct ttc aca gga ctg gtc aac ate ttg aca gat att aat tgg cga ttc 458 
Ser Phe Thr Gly Leu Val Asn He Leu Thr Asp He Asn Trp Arg Phe 
55 100 105 110 



act gga gac ttc acg gca cct gac ctg gtt tgc cga gtg gtc cgc tat 
Thr Gly Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val Val Arg Tyr 

115 120 125 



age ata gac aga tac eat gee ate gtc tac cec atg aag ttc ctt caa 
Ser lie Asp Arg Tyr His Ala He Val Tyr Pro Met Lys Phe Leu Gin 
145 150 155 



aag get ate aag tat age ate ate ate att ctt gcc ttc ate tgc tgt 
Lys Ala He Lys Tyr Ser He He He He Leu Ala Phe He Cys Cys 

275 280 285 



506 



ttg cag gtt gtg ctg etc tac gcc tct acc tac gtc ctg gtg tec etc 554 
Leu Gin Val Val Leu Leu Tyr Ala Ser Thr Tyr Val Leu Val Ser Leu 

130 135 140 



602 



gga gaa aag caa gee agg gtc etc att gtg ate gee tgg age ctg tct 650 
Gly Glu Lys Gin Ala Arg Val Leu He Val He Ala Trp Ser Leu Ser 
160 165 170 

ttt ctg ttc tec att cec acc ctg ate ata ttt ggg aag agg aca ctg 698 
Phe Leu Phe Ser He Pro Thr Leu He He Phe Gly Lys Arg Thr Leu 
175 180 185 190 

tec aac ggt gaa gtg cag tgc tgg gee ctg tgg cct gac gac tec tac 746 
Ser Asn Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr 

195 200 205 

tgg acc cca tac atg ace ate gtg gcc ttc ctg gtg tac ttc ate cct 794 
Trp Thr Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro 

210 215 220 

ctg aca ate ate age ate atg tat ggc att gtg ate cga act att tgg 842 
Leu Thr He He Ser He Met Tyr Gly He Val He Arg Thr He Trp 
225 230 235 

att aaa age aaa acc tac gaa aca gtg att tec aac tgc tea gat ggg 890 
He Lys Ser Lys Thr Tyr Glu Thr Val He Ser Asn Cys Ser Asp Gly 
240 245 250 

aaa ctg tgc age age tat aac cga gga etc ate tea aag gca aaa ate 938 
Lys Leu Cys Ser Ser Tyr Asn Arg Gly Leu He Ser Lys Ala Lys He 
255 260 265 270 



986 



tgg agt cca tac ttc ctg ttt gac att ttg gac aat ttc aac etc ett 1034 
Trp Ser Pro Tyr Phe Leu Phe Asp He Leu Asp Asn Phe Asn Leu Leu 

290 295 300 
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cca gac acc cag gag cgt ttc tat gcc tct gtg ate att cag aac ctg 
Pro Asp Thr Gin Glu Arg Phe Tyr Ala Ser Val lie lie Gin Asn Leu 
305 310 315 



1082 



cca gca ttg aat agt gcc ate aac ccc etc ate tac tgt gtc ttc age 
Pro Ala Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser 
320 325 330 



1130 



age tee ate tct ttc ccc tge agg gtc ate cgt etc cgt cag etc cag 1178 
Ser Ser He Ser Phe Pro Cys Arg Val He Arg Leu Arg Gin Leu Gin 
335 340 345 350 

gag get gcg eta atg etc tge cct caa ega gag aac tgg aag ggt act 1226 
Glu Ala Ala Leu Met Leu Cys Pro Gin Arg Glu Asn Trp Lys Gly Thr 

355 360 365 

tgg cca ggt gta cct tec tgg get ctt cca agg tgaeagctct caeectgtgc 1279 
Trp Pro Gly Val Pro Ser Trp Ala Leu Pro Arg 

370 375 

tgcaggtggc cctgtgcctg gtgeeactte teactgetta ceagggcaca aggacaccag 1339 
tggttcccaa aatgggtcac agcaggatgg ectgeatcag atteaecagg gagggctata 1399 



agaaggcaga e 



1410 



<210> 5 

<211> 377 

<212> PRT 

<213> Homo sapiens 



<400> 5 

Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
1 5 10 15 



Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 

20 25 30 



Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
35 40 45 



Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe Thr He 
50 55 60 



Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
65 70 75 



80 



Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp Ser Phe 



wo 2004/056866 

85 



79/111 
90 
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95 



Thr Gly Leu Val Asn lie Leu Thr Asp He Asn Trp Arg Phe Thr Gly 

100 105 110 



Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val Val Arg Tyr Leu Gin 
115 120 125 



Val Val Leu Leu Tyr Ala Ser Thr Tyr Val Leu Val Ser Leu Ser He 

130 135 140 



Asp Arg Tyr His Ala He Val Tyr Pro Met Lys Phe Leu Gin Gly Glu 
145 150 155 160 



Lys Gin Ala Arg Val Leu He Val He Ala Trp Ser Leu Ser Phe Leu 

165 170 175 



Phe Ser He Pro Thr Leu He He Phe Gly Lys Arg Thr Leu Ser Asn 

180 185 190 



Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr Trp Thr 
195 200 205 



Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro Leu Thr 
210 215 220 



He He Ser He Met Tyr Gly He Val He Arg Thr He Trp He Lys 
225 230 235 240 



Ser Lys Thr Tyr Glu Thr Val He Ser Asn Cys Ser Asp Gly Lys Leu 

245 250 255 



Cys Ser Ser Tyr Asn Arg Gly Leu He Ser Lys Ala Lys He Lys Ala 

260 265 270 



He Lys Tyr Ser He He He He Leu Ala Phe He Cys Cys Trp Ser 
275 280 285 



Pro Tyr Phe Leu Phe Asp He Leu Asp Asn Phe Asn Leu Leu Pro Asp 
290 295 300 



Thr Gin Glu Arg Phe Tyr Ala Ser Val He He Gin Asn Leu Pro Ala 
305 310 315 320 



Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser Ser Ser 

325 330 335 
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He Ser Phe Pro Cys Arg Val He Arg Leu Arg Gin Leu Gin Glu Ala 

340 345 350 



Ala Leu Met Leu Cys Pro Gin Arg Glu Asn Trp Lys Gly Thr Trp Pro 
355 360 365 



Gly Val Pro Ser Trp Ala Leu Pro Arg 
370 375 



<210> 6 

<211> 1377 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (129) . . (1226) 



<400> 6 

gggctcaggg agggctctgt gcctccgttc agcagagctg cagctgctgc ccagctctca 60 

ggaggcaagc tggactccct cactcagctg caggagcaag gacagtgagg ctcaaccccg 120 

cctgagcc atg cca gcc aac ttc aca gag ggc age ttc gat tec agt ggg 170 
Met Pro Ala Asn Phe Thr Glu Gly Ser Phe TVsp Ser Ser Gly 
15 10 

acc ggg cag acg ctg gat tct tec cca gtg get tgc act gaa aca gtg 218 
Thr Gly Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val 
15 20 25 30 

act ttt act gaa gtg gtg gaa gga aag gaa tgg ggt tec ttc tac tac 266 
Thr Phe Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr 

35 40 45 

tec ttt aag act gag caa ttg ata act ctg tgg gtc etc ttt gtt ttt 314 
Ser Phe Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe 

50 55 60 

acc att gtt gga aac tec gtt gtg ctt ttt tec aca tgg agg aga aag 362 
Thr He Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys 
65 70 75 

aag aag tea aga atg acc ttc ttt gtg act cag ctg gcc ate aca gat 410 
Lys Lye Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp 
80 85 90 
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att aat tgg cga ttc act gga gac ttc acg gca cct gac ctg gtt tgc 458 

lie Aen Trp Arg Phe Thr Gly Asp Phe Thr Ala Pro Asp Leu Val Cys 

55 100 105 110 

cga gtg gtc cgc tat ttg cag gtt gtg ctg etc tac gcc tct acc tac 506 
Arg Val Val Arg Tyr Leu Gin Val Val Leu Leu Tyr Ala Ser Thr Tyr 

115 120 125 

gtc ctg gtg tec etc age ata gac aga tac cat gcc ate gtc tac ccc 554 
Val Leu Val Ser Leu Ser He Asp Arg Tyr His Ala He Val Tyr Pro 

130 135 140 

atg aag ttc ett caa gga gaa aag caa gcc agg gtc etc att gtg ate 602 
Met Lys Phe Leu Gin Gly Glu Lys Gin Ala Arg Val Leu He Val He 
145 150 155 



gcc tgg age ctg tct ttt ctg ttc tec att ccc acc ctg ate ata ttt 
Ala Trp Ser Leu Ser Phe Leu Phe Ser He Pro Thr Leu He He Phe 
160 165 170 



cct gac gac tec tac tgg ace cea tac atg ace ate gtg gee ttc ctg 
Pro Asp Asp Ser Tyr Trp Thr Pro Tyr Met Thr He Val Ala Phe Leu 

195 200 205 



ate att cag aac ctg cea gca ttg aat agt gcc ate aac ccc etc ate 
He He Gin Asn Leu Pro Ala Leu Asn Ser Ala He Asn Pro Leu He 
305 310 315 



650 



ggg aag agg aca ctg tec aac ggt gaa gtg cag tgc tgg gcc ctg tgg 698 
Gly Lys Arg Thr Leu Ser Asn Gly Glu Val Gin Cys Trp Ala Leu Trp 
175 180 185 190 



746 



gtg tac ttc ate cct ctg aca ate ate age ate atg tat gge att gtg 794 
Val Tyr Phe He Pro Leu Thr He He Ser He Met Tyr Gly He Val 

210 215 220 

ate cga act att tgg att aaa age aaa acc tac gaa aca gtg att tec 842 
He Arg Thr He Trp He Lys Ser Lys Thr Tyr Glu Thr Val He Ser 
225 230 235 

aac tgc tea gat ggg aaa ctg tgc age age tat aac cga gga etc ate 890 
Asn Cys Ser Asp Gly Lys Leu Cys Ser Ser Tyr Asn Arg Gly Leu He 
240 245 250 

tea aag gca aaa ate aag get ate aag tat age ate ate ate att ett 938 
Ser Lys Ala Lys He Lys Ala He Lys Tyr Ser He He He He Leu 
255 260 265 270 

gcc ttc ate tgc tgt tgg agt cea tac ttc ctg ttt gac att ttg gac 986 
Ala Phe He Cys Cys Trp Ser Pro Tyr Phe Leu Phe Asp He Leu Asp 

275 280 285 

aat ttc aac etc ett cea gac acc cag gag cgt ttc tat gcc tct gtg 1034 
Asn Phe Asn Leu Leu Pro Asp Thr Gin Glu Arg Phe Tyr Ala Ser Val 

290 295 300 



1082 



tac tgt gtc ttc age age tee ate tct ttc ccc tgc agg gtc ate cgt 1130 
Tyr Cys Val Phe Ser Ser Ser He Ser Phe Pro Cys Arg Val He Arg 
320 325 330 

etc cgt cag etc cag gag get geg eta atg etc tgc cct caa cga gag 1178 
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Leu Arg Gin Leu Gin Glu Ala Ala Leu Met Leu Cya Pro Gin Arg Glu 
335 340 345 350 

aac tgg aag ggt act tgg cca ggt gta cct tec tgg get ctt cca agg 1226 
Asn Trp Lys Qly Thr Trp Pro Gly Val Pro Ser Trp Ala Leu Pro Arg 

355 360 365 

tgacagctct caccctgtgc tgcaggtggc cctgtgcctg gtgccacttc tcactgctta 1286 
ccagggcaca aggacaccag tggttcccaa aatgggtcac agcaggatgg cctgcatcag 1346 
attcaccagg gagggctata agaaggcaga c 1377 

<210> 7 

<211> 366 

<212> PRT 

<213> Homo sapiens 



<400> 7 

Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
1 5 • 10 15 



Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 

20 25 30 



Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
35 40 45 



Lys Thr Glu Gin Leu lie Thr Leu Trp Val Leu Phe Val Phe Thr He 
50 55 60 



Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
65 70 75 80 



Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp He Asn 

85 90 95 



Trp Arg Phe Thr Gly Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val 

100 105 110 



Val Arg Tyr Leu Gin Val Val Leu Leu Tyr Ala Ser Thr Tyr Val Leu 
115 120 125 



Val Ser Leu Ser He Asp Arg Tyr His Ala He Val Tyr Pro Met Lys 
130 135 140 
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Phe Leu Gin Gly Glu Lys Gin Ala Arg Val Leu He Val He Ala Trp 
145 150 155 160 



Ser Leu Ser Phe Leu Phe Ser He Pro Thr Leu He He Phe Gly Lya 

165 170 175 



Arg Thr Leu Ser Asn Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp 

180 185 190 



Asp Ser Tyr Trp Thr Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr 
195 200 205 



Phe He Pro Leu Thr He He Ser He Met Tyr Gly He Val He Arg 
210 215 220 



Thr He Trp He Lys Ser Lys Thr Tyr Glu Thr Val ile Ser Asn Cys 
225 230 235 240 



Ser Asp Gly Lys Leu Cys Ser Ser Tyr Asn Arg Gly Leu He Ser Lys 

245 250 255 



Ala Lys He Lys Ala He Lys Tyr Ser He He He He Leu Ala Phe 

260 265 270 



He Cys Cys Trp Ser Pro Tyr Phe Leu Phe Asp He Leu Asp Asn Phe 
275 280 285 



Asn Leu Leu Pro Asp Thr Gin Glu Arg Phe Tyr Ala Ser Val He He 
290 295 300 



Gin Asn Leu Pro Ala Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys 
305 310 315 320 



Val Phe Ser Ser Ser He Ser Phe Pro Cys Arg Val He Arg Leu Arg 

325 330 335 



Gin Leu Gin Glu Ala Ala Leu Met Leu Cys Pro Gin Arg Glu Asn Trp 

340 345 350 



Lys Gly Thr Trp Pro Gly Val Pro Ser Trp Ala Leu Pro Arg 
355 360 365 



<210> 8 



<211> 814 
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<220> 

<22X> CDS 

<222> (129) . . (410) 



<400> 8 

gggctcaggg agggctctgt gcctccgttc agcagagctg cagctgctgc ccagctctca 60 

ggaggcaagc tggactccct cactcagctg caggagcaag gacagtgagg ctcaaccccg 120 

cctgagcc atg cca gcc aac ttc aca gag ggc age ttc gat tec agt ggg 170 
Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly 
1 5 10 

acc ggg cag acg ctg gat tct tec cca gtg get tge act gaa aca gtg 218 
TUT Gly Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val 
" 20 25 30 

Sf^^ *^9Sf ggt tec ttc tac tac 266 
Thr Phe Tlfr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr 

35 40 45 

tec ttt aag act gag caa ttg ata act ctg tgg gtc etc ttt gtt ttt 314 
Ser Phe Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe 

50 55 60 

acc att gtt gga aac tec gtt gtg ett ttt tec aca tgg agg aga aag 362 
Thr He Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys 
65 70 75 



aag aag tea aga atg ace ttc ttt gtg act cag ctg gcc ate aca gta 
Lys Lys Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Val 
80 85 90 



410 



taacaagccc 


acctgcttga 


gctgggctgc 


agtggceagg gtaaacatec aaggcaccag 


470 


tgaaaaatac 


agagaaggta 


aaaggagcaa 


gagttctgaa gatggaacct gggatggggg 


530 


aaagtttctt 


caatctttcc 


taceaacaag 


aactccaatt tttcactcct ataaccgtag 


590 


aagtagaggt 


aattaggate 


atccagcaaa 


tgcttagagg caaatatcce tggatgagga 


650 


tgccacagct 


tattttcatt 


atatttcttc 


gattacagtg tggtaatgca tgttgtatgg 


710 


aactacatat 


tctttcagaa 


tgaaaggatt 


tagaggtggc aagaatatca gcttgaaatt 


770 


taaagttttt 


teataaacaa 


taaacaaatg 


ataattgaaa attc 


814 



<210> 9 
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<211> 94 

<212> PRT 

<213> Homo sapiens 

<400> 9 

Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
15 10 15 

Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 

20 25 30 

Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
35 40 45 

Lys Thr Glu Gin Leu lie Thr Leu Trp Val Leu Phe Val Phe Thr lie 
50 55 60 

Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
^5 70 75 80 

Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala lie Thr Val 

85 90 

<210> 10 

<211> 1463 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (129) . . (602) 

<400> 10 

gggctcaggg agggctctgt gcctccgttc agcagagctg cagctgctgc ccagctctca 60 

ggaggcaagc tggactccct cactcagctg caggagcaag gacagtgagg ctcaaccccg 120 

cctgagcc atg cca gcc aac ttc aca gag ggc age ttc gat tec agt ggg 170 
Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly 
15 10 
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acc ggg cag acg ctg gat tct tec cca gtg get tgc act gaa aca gtg 218 
Thr Gly Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val 
15 20 25 30 

act ttt act gaa gtg gtg gaa gga aag gaa tgg ggt tec ttc tac tac 266 
Thr Phe Thr Glu Val ,Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr 

35 40 45 

tec ttt aag act gag caa ttg ata act ctg tgg gtc etc ttt gtt ttt 314 
Ser Phe Lys Thr Glu Gin Leu lie Thr Leu Trp Val Leu Phe Val Phe 

50 55 60 

acc att gtt gga aac tec gtt gtg ett ttt tec aca tgg agg aga aag 362 
Thr He Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys 
65 70 75 

aag aag tea aga atg acc ttc ttt gtg act cag ctg gee ate aca ggt 410 
Lys Lys Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Gly 
80 85 90 

tgt get get eta cgc etc tac eta cgt cet ggt gtc ect cag eat aga 458 
Cys Ala Ala Leu Arg Leu Tyr Leu Arg Pro Gly Val Pro Gin His Arg 
95 100 105 110 

cag ata cca tgc cat cgt eta ecc eat gaa gtt ect tea agg aga aaa 506 
Gin He Pro Cys His Arg Leu Pro His Glu Val Pro Ser Arg Arg Lys 

115 120 125 

gea age cag ggt ect cat tgt gat cgc ctg gag cet gtc ttt tct gtt 554 
Ala Ser Gin Gly Pro His Cys Asp Arg Leu Glu Pro Val Phe Ser Val 

130 135 140 

etc cat tee eac ect gat eat att tgg gaa gag gae act gtc caa egg 602 
Leu His Ser His Pro Asp His He Trp Glu Glu Asp Thr Val Gin Arg 
145 150 155 



tgaagtgeag 


tgctgggcce 


tgtggeetga 


cgacteetae 


tggaeeeeat 


acatgaccat 


662 


cgtggectte 


ctggtgtaet 


teateectct 


gacaateate 


ageateatgt 


atggcattgt 


722 


gatccgaact 


atttggatta 


aaagcaaaac 


etacgaaaea gtgatttcea 


aetgetcaga 


782 


tgggaaactg 


tgcageaget 


ataaecgagg 


actcatctea 


aaggcaaaaa 


teaaggctat 


842 


caagtatage 


ateateatea 


ttcttgcett 


eatetgetgt 


tggagtecat 


aettectgtt 


902 


tgaeattttg 


gacaatttea 


aectecttce 


agacaeceag gagcgtttet 


atgeetctgt 


962 


gateatteag 


aacctgecag 


eattgaatag 


tgccatcaac 


cccctcatet 


aetgtgtctt 


1022 


eagcagctce 


atctctttec 


cctgcaggga 


geaaagatea 


eaggatteca 


gaatgacgtt 


1082 


eegggagaga 


aetgagaggc 


atgagatgea 


gattctgtce 


aagccagaat 


tcatctagac 


1142 


cctagggcag 


tgecagtgct 


aggctgagea 


ceatcagete 


teeeaggtee 


ttgtcacetg 


1202 


cttgggeacg 


tgcatggaac 


ecgagceaac 


tteaceeeae 


ectegteatt 


acetgggaga 


1262 


tgeacaagac 


aaatgtteta 


atgactgeat 


geactgetta 


agtattggec 


aacaegaact 


1322 


ecccagttat 


tcatgceagc 


eaggaaggaa 


acgccttcct 


tecceaeeat 


tcceagcect 


1382 



wo 2004/056866 



87/111 



PCT/FI2003/000973 



ccttcccact ggccagcacc tgaacccagt gaacacaggc attagtggtc cagggtcctg 1442 
gcttggagcc agtgagtaga c 14g3 



<210> 11 

<211> 158 

<;212> PRT 

<213> Homo sapiens 



<400> 11 



Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
^5 10 15 



Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 

20 25 30 



Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
35 40 45 



Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe Thr He 
50 55 60 



Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
^5 70 75 80 



Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Gly Cys Ala 

85 90 95 

Ala Leu Arg Leu Tyr Leu Arg Pro Gly Val Pro Gin His Arg Gin He 

100 105 110 



Pro Cys His Arg Leu Pro His Glu Val Pro Ser Arg Arg Lys Ala Ser 
115 120 125 



Gin Gly Pro His Cys Asp Arg Leu Glu Pro Val Phe Ser Val Leu His 
130 135 



Ser His Pro Asp His He Trp Glu Glu Asp Thr Val Gin Arg 
145 150 155 



<210> 12 
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<211> 1473 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (129) . . (536) 



<400> 12 * 

gggctcaggg agggctctgt gcctccgttc agcagagctg cagctgctgc ccagctctca 60 

ggaggcaagc tggactccct cactcagctg caggagcaag gacagtgagg ctcaaccccg 120 

cctgagcc atg cca gcc aac ttc aca gag ggc age ttc gat tec agt ggg 170 
Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly 
is 10 

acc ggg cag acg ctg gat tct tec cca gtg get tge act gaa aca gtg 218 
Thr Gly Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val 
15 20 25 30 

act ttt act gaa gtg gtg gaa gga aag gaa tgg ggt tec ttc tac tac 266 
Thr Phe Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr 

35 40 45 

tee ttt aag act gag caa ttg ata act ctg tgg gtc etc ttt gtt ttt 314 
Ser Phe Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe 

50 55 60 

ace att gtt gga aac tec gtt gtg ctt ttt tec aca tgg agg aga aag 362 
Thr He Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys 
65 70 75 

aag aag tea aga atg acc ttc ttt gtg act cag ctg gee ate aca gat 4lo 
Lys Lys Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Asp 
80 85 90 

tct ttc aca gga ctg gtc aac ate ttg aca gat att aat tgg cga ttc 458 
Ser Phe Thr Gly Leu Val Asn He Leu Thr Asp He Asn Trp Arg Phe 
95 100 105 110 

act gga gac ttc acg gca cct gae ctg gtt tge cga gtg gtc cge tat 506 
Thr Gly Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val Val Arg Tyr 

115 120 125 

ttg cag aaa age aag cca ggg tee tea ttg tgatcgcetg gagcetgtct 556 
Leu Gin Lys Ser Lys Pro Gly Ser Ser Leu 

130 135 

tttetgttct ecatteecae cctgateata tttgggaaga ggacactgtc eaacggtgaa 616 
gtgeagtgct gggeeetgtg gectgaegae tectaetgga ecccatacat gaccategtg 676 
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gccttcctgg tgtacttcat ccctctgaca atcatcagca tcatgtatgg cattgtgatc 
cgaactattt ggattaaaag caaaacctac gaaacagtga tttccaactg ctcagatggg 
aaactgtgca gcagctataa ccgaggactc atctcaaagg caaaaatcaa ggctatcaag 
tatagcatca tcatcattct tgccttcatc tgctgttgga gtccatactt cctgtttgac 
attttggaca atttcaacct ccttccagac acccaggagc gtttctatgc ctctgtgatc 
attcagaacc tgccagcatt gaatagtgcc atcaaccccc tcatctactg tgtcttcagc 
agctccatct ctttcccctg cagggagcaa agatcacagg attccagaat gacgttccgg 
gagagaactg agaggcatga gatgcagatt ctgtccaagc cagaattcat ctagacccta 
gggcagtgcc agtgctaggc tgagcaccat cagctctccc aggtccttgt cacctgcttg 
ggcacgtgca tggaacccga gccaacttca ccccaccctc gtcattacct gggagatgca 
caagacaaat gttctaatga ctgcatgcac tgcttaagta ttggccaaca cgaactcccc 
agttattcat gccagccagg aaggaaacgc cttccttccc caccattccc agccctcctt 
cccactggcc agcacctgaa cccagtgaac acaggcatta gtggtccagg gtcctggctt 
ggagccagtg agtagac 



736 
796 
856 
916 
976 
1036 
1096 
1156 
1216 
1276 
1336 
1396 
1456 
1473 



<210> 13 

<211> 136 

<212> PRT 

<213> Homo sapiens 



<400> 13 

Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
15 10 15 



Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 

20 25 30 



Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
35 40 45 



Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe Thr He 
50 55 60 



Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
" 70 75 80 
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Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala lie Thr Asp Ser Phe 

85 90 95 



Thr Qly Leu Val Asn He Leu Thr Asp He Asn Trp Arg Phe Thr Gly 

100 105 110 



Asp Phe Thr Ala Pro Asp Leu Val Cys Arg Val Val Arg Tyr Leu Gin 
115 120 125 



Lys Ser Lys Pro Gly Ser Ser Leu 
130 135 



<210> 14 

<211> 1369 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (129) . . (1043) 



<400> 14 

gggctcaggg agggctctgt gcctccgttc agcagagctg cagctgctgc ccagctctca 60 

ggaggcaagc tggactccct cactcagctg caggagcaag gacagtgagg ctcaaccccg 120 

cctgagcc atg cca gcc aac ttc aca gag ggc age ttc gat tec agt ggg 170 
Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly 
15 10 

acc ggg cag acg ctg gat tct tec cca gtg get tgc act gaa aca gtg 218 
Thr Gly Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val 
15 20 25 30 

act ttt act gaa gtg gtg gaa gga aag gaa tgg ggt tec ttc tac tac 266 
Thr Phe Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr 

35 40 45 

tee ttt aag act gag caa ttg ata act ctg tgg gtc etc ttt gtt ttt 314 
Ser Phe Lys Thr Glu Gin Leu He Thr Leu Trp Val Leu Phe Val Phe 

50 55 60 

acc att gtt gga aac tec gtt gtg ctt ttt tec aca tgg agg aga aag 362 
Thr He Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys 
65 70 75 

aag aag tea aga atg acc ttc ttt gtg act cag ctg gcc ate aca gaa 410 
Lys Lys Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Glu 
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80 



85 



90 



aag caa gcc agg gtc etc att gtg ate gee tgg age ctg tct ttt etg 
Lys Gin Ala Arg Val Leu lie Val lie Ala Trp Ser Leu Ser Phe Leu 
95 100 105 110 

ttc tec att ccc ace ctg ate ata ttt ggg aag agg aca ctg tee aac 
Phe Ser lie Pro Thr Leu lie He Phe Gly Lys Arg Thr Leu Ser Asn 

115 120 125 

ggt gaa gtg cag tgc tgg gee etg tgg eet gac gae tee tac tgg aee 
Gly Glu Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr Trp Thr 

130 135 140 

cca tac atg ace ate gtg gee ttc ctg gtg tac ttc ate eet etg aca 
Pro Tyr Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro Leu Thr 
145 150 155 

ate ate age ate atg tat gge att gtg ate cga act att tgg att aaa 
He He Ser He Met Tyr Gly He Val He Arg Thr He Trp He Lys 
160 165 170 



458 



506 



554 



602 



650 



age aaa ace tac gaa aca gtg att tee aac tgc tea gat ggg aaa etg 
Ser Lys Thr Tyr Glu Thr Val He Ser Asn Cys Ser Asp Gly Lys Leu 
175 180 185 190 

tgc age age tat aac cga gga etc ate tea aag gca aaa ate aag get 
Cys Ser Ser Tyr Asn Arg Gly Leu He Ser Lys Ala Lys He Lys Ala 

195 200 205 

ate aag tat age ate ate ate att ctt gee ttc ate tgc tgt tgg agt 
He Lys Tyr Ser He He He He Leu Ala Phe He Cys Cys Trp Ser 

210 215 220 

cca tac ttc etg ttt gac att ttg gac aat ttc aac etc ctt cca gac 
Pro Tyr Phe Leu Phe Asp He Leu Asp Asn Phe Asn Leu Leu Pro Asp 
225 230 235 

ace cag gag egt ttc tat gcc tct gtg ate att cag aac ctg cca gca 
Thr Gin Glu Arg Phe Tyr Ala Ser Val He He Gin Asn Leu Pro Ala 
240 245 250 

ttg aat agt gcc ate aac ccc etc ate tac tgt gtc ttc age age tec 
Leu Asn Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser Ser Ser 
255 260 265 270 

ate tct ttc ccc tgc agg gag caa aga tea cag gat tec aga atg acg 
He Ser Phe Pro Cys Arg Glu Gin Arg Ser Gin Asp Ser Arg Met Thr 

275 280 285 

ttc egg gag aga act gag agg cat gag atg cag att etg tee aag cca 
Phe Arg Glu Arg Thr Glu Arg His Glu Met Gin He Leu Ser Lys Pro 

290 295 300 

gaa ttc ate tagaccctag ggcagtgcca gtgetagget gagcaccate 
Glu Phe He 
305 



698 



746 



794 



842 



890 



938 



986 



1034 



1083 



agetctccca ggtccttgtc acctgettgg gcacgtgcat ggaaccegag ecaacttcac 1143 
cccaccctcg teat t ace tg ggagatgcac aagacaaatg ttctaatgae tgcatgcact 1203 
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gcttaagtat tggccaacac gaactcccca gttattcatg ccagccagga aggaaacgcc 1263 
ttccttcccc accattccca gccctccttc ccactggcca gcacctgaac ccagtgaaca 1323 
caggcattag tggtccaggg tcctggcttg gagccagtga gtagac 1369 



<210> 15 

<211> 305 

<212> PRT 

<213> Homo sapiens 



<400> 15 



Met Pro Ala Asn Phe Thr Glu Gly Ser Phe Asp Ser Ser Gly Thr Gly 
15 10 15 



Gin Thr Leu Asp Ser Ser Pro Val Ala Cys Thr Glu Thr Val Thr Phe 

20 25 30 



Thr Glu Val Val Glu Gly Lys Glu Trp Gly Ser Phe Tyr Tyr Ser Phe 
35 40 ' 45 . 



Lys Thr Glu Gin Leu lie Thr Leu Trp Val Leu Phe Val Phe Thr lie 
50 55 60 



Val Gly Asn Ser Val Val Leu Phe Ser Thr Trp Arg Arg Lys Lys Lys 
65 70 75 80 



Ser Arg Met Thr Phe Phe Val Thr Gin Leu Ala He Thr Glu Lys Gin 

85 90 95 



Ala Arg Val Leu He Val He Ala Trp Ser Leu Ser Phe Leu Phe Ser 

100 105 110 



He Pro Thr Leu He He Phe Gly Lys Arg Thr Leu Ser Asn Gly Glu 
115 120 125 



Val Gin Cys Trp Ala Leu Trp Pro Asp Asp Ser Tyr Trp Thr Pro Tyr 
130 135 140 



Met Thr He Val Ala Phe Leu Val Tyr Phe He Pro Leu Thr He He 
145 150 155 160 



Ser He Met Tyr Gly He Val He. Arg Thr He Trp He Lys Ser Lys 
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175 



Thr Tyr Glu Thr Val He Ser Asn Cys Ser Asp Gly Lys Leu Cys Ser 

180 185 190 



Ser Tyr Asn Arg Gly Leu He Ser Lys Ala Lys He Lys Ala He Lys 
195 200 205 



Tyr Ser He He He He Leu Ala Phe He Cys Cys Trp Ser Pro Tyr 
210 215 220 



Phe Leu Phe Asp He Leu Asp Asn Phe Asn Leu Leu Pro Asp Thr Gin 
225 230 235 240 



Glu Arg Phe Tyr Ala Ser Val He He Gin Asn Leu Pro Ala Leu Asn 

245 250 255 



Ser Ala He Asn Pro Leu He Tyr Cys Val Phe Ser Ser Ser He Ser 

260 265 270 



Phe Pro Cys Arg Glu Gin Arg Ser Gin Asp Ser Arg Met Thr Phe Arg 
275 280 285 



Glu Arg Thr Glu Arg His Glu Met Gin He Leu Ser Lys Pro Glu Phe 
290 295 300 



He 
305 



<210> 16 

<211> 1532 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (231) . . (383) 



<400> 16 

gacacagaga agaccactcc cacctccccg agtgcaaggt gtgaagggac agatctttta 60 



accatgcctg cccccttata cttgctgttc atagaattgc aactgaaagt gaccatgagg 120 
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atccactgga tggagttact tctttcttaa gtgaggaggc taagatctgg agtgacttct 180 

ccccagattt ttgtatacct gactctgttt cagcatccgc ttcccaaaga atg cag 236 

Met Gin 
1 

tgt gaa gca gga get tat gtg aga aga aac gca ggg aga cag ttc agt 284 
Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin Phe Ser 
5 10 15 

cac tgc aat ctt cat gcc cat cag ttt ctt gtg aga aga aaa caa gag 332 
His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys Gin Glu 
20 25 30 

etc caa tct gtt aga tgg tat ttt gaa gca ggt ctt tgg gta aag gac 380 
Leu Gin Ser Val Arg Trp Tyr Phe Glu Ala Gly Leu Trp Val Lys Asp 
35 40 45 50 

acc tagacccagt gaaggtcatg gtgattatta ttggacaatg ggacatcact 433 

ctgctatttg aacaaataag actttttcct gacatgcatc tggaggcagg tcaaagctcc 493 

aggccaactc caagtttctg atggggtctc tagccaatgg aaggcttctt ctccttcaat 553 

tgcctgactc ttcaggactc ttaaatactg caaagtgaga aaatgagaca ggttgcactg 613 

agggctgtta gccagacaga gtctcgaact ggaagtccat ctagatgttt tgcataagag 673 

aatggaaaca atctgtctgt gatttaggga catactctgg cagcaatatg ggaatacagt 733 

ttcaatcctc attaacaaaa caggtatgaa atacatattt atttagtaag gtgccagctg 793 

tatgaaaaat ccatttctta tttcccataa tgtttctgaa atgtcttagc agtgcataga 853 

gacagcatgt catcattttc tagggactgt gtgttattgc atttttccta gggaagatct 913 

tttctaggtc acctgctcct tcgctaaagc tctgaccaat ctagcttgct aacctgtgac 973 

tccattttcc taagtcctga gagagaaaaa cgctttgcag caaattatgc caggcatcct 1033 

tgtgtctaaa tgaaaaaagg aaaaagcctc cttccttccc tctgttgaga agtgcacggt 1093 

ccacatatgc atgcacagca tatactgtga gggtatttgc agtcccttgg gttgctttga 1153 

taactggcca ggttgctgtt ctatttttcc acattctatt aatcctccta caggcagtta 1213 

ttaggtattg agtgctcaca cacccctggc atagtcacca catgccatta gctccagata 1273 

aacttccaga aaaaagtcca tcccccactt ctctcagctg cctgccaacg ctggacacct 1333 

tctcaccaag ccagcaggac agagaaaagc ctgggcttta agatcaaaca aacacagctt 1393 

caaattagga ctctgtcact tcctgtgtac tgggcacttt gctgagtatg tggtttctca 1453 

tctgtaaaac agagaaagat gattatctcc caatctttct atgttatatg tttgaattaa 1513 

ataaggtact ctccatgaa ,c-an 
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<210> 17 
<211> 51 
<212> PRT 

<213> Homo sapiens 

<400> 17 

Met Gin Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin 
15 10 15 

Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys 

20 25 30 

Gin Glu Leu Gin Ser Val Arg Trp Tyr Phe Glu Ala Gly Leu Trp Val 
35 40 45 

Lys Asp Thr 
50 

<210> IB 

<211> 1407 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (244) . . (402) 

<400> 18 

gacacagaga agaccactcc cacctccccg agtgcaaggt gtgaagggac agatctttta 60 

accatgcctg cccccttata cttgctgttc atagaattgc aactgaaagt gaccatgagg 120 

atccactgga tggagttact tctttcttaa gtgaggaggc taagatctga gttcttcaca 180 

tctctctgta gataaaattt ccggtctggt ttcacattcc tctgtcagaa gaactttctt 240 

taa tgt ttc tta aag tac agg tct get get tat gtg aga aga aac gca 288 
Cys Phe Leu Lys Tyr Arg Ser Ala Ala Tyr Val Arg Arg Asn Ala 
15 10 15 

ggg aga cag ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt gtg 336 
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Gly Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val 

20 25 30 

aga aga aaa caa gag etc caa tct gtt aga tgg tat ttt gaa gca ggt 384 
Arg Arg Lys Gin Qlu Leu Gin Ser Val Arg Trp Tyr Phe Glu Ala Gly 

35 40 45 

ctt tgg gta aag gac acc tagacccagt gaaggtcatg gtgattatta 432 
Leu Trp Val Lys Asp Thr 
50 



ttggacaatg 


ggacatcact ctgctattaa gtgagaaaat gagacaggtt gcactgaggg 


492 


ctgttagcca 


gacagagtct cgaactggaa gtccatctag atgttttgca taagagaatg 


552 


gaaacaatct 


gtctgtgatt tagggacata ctctggcagc aatatgggaa tacagtttca 


612 


atcctcatta 


acaaaacagg tatgaaatac atatttattt agtaaggtgc cagctgtatg 


672 


aaaaatccat 


ttcttatttc ccataatgtt tctgaaatgt cttagcagtg catagagaca 


732 


gcatgtcatc 


attttctagg gactgtgtgt tattgcattt ttcctaggga agatcttttc 


792 


taggtcacct 


gctccttcgc taaagctctg accaatctag cttgctaacc tgtgactcca 


852 


ttttcctaa.cr 


wvvuu^ci^Giya ^ctctaciacgcu ccgcagcaaa ccacgccagg catccttgtg 


912 


tctaaatgaa 


aaaaggaaaa agcctccttc cttccctctg ttgagaagtg cacggtccac 


972 


atatgcatgc 


acagcatata ctgtgagggt atttgcagtc ccttgggttg ctttgataac 


1032 


tggccaggtt 


gctgttctat ttttccacat tctattaatc ctcctacagg cagttattag 


1092 


gtattgagtg 


ctcacacacc cctggcatag tcaccacatg ccattagctc cagataaact 


1152 


tccagaaaaa 


agtccatccc ccacttctct cagctgcctg ccaacgctgg acaccttctc 


1212 


accaagccag 


caggacagag aaaagcctgg gctttaagat caaacaaaca cagcttcaaa 


1272 


ttaggactct 


gtcacttcct gtgtactggg cactttgctg agtatgtggt ttctcatctg 


1332 


taaaacagag 


aaagatgatt atctcccaat ctttctatgt tatatgtttg aattaaataa 


1392 


ggtactctcc 


atgaa 


1407 



<210> 19 

<211> 53 

<212> PRT 

<213> Homo sapiens 



<400> 19 



Cys Phe Leu Lys Tyr Arg Ser Ala Ala Tyr Val Arg Arg Asn Ala Gly 
15 10 15 
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Arg Gin Phe Ser His Cys Asn Leu Hia Ala Hie Gin Phe Leu Val Arg 

20 25 30 



Arg Lys Qln Qlu Leu Gin Ser Val Arg Trp Tyr Phe Glu Ala Gly Leu 
35 40 45 



Trp Val Lys Asp Thr 
50 



<210> 20 

<211> 341 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (146) . . (328) 



<400> 20 

gctgatggtg gaaggagaat gagtctctga tgcctttgga cttgatgctg gaaagactta 60 

agactttggg ggactactgg aaaggagtga cttctcccca gatttttgta tacctgactc 120 

tgtttcagca tccgcttccc aaaga atg cag tgt gaa gca gga get tat gtg 172 

Met Gin Cys Glu Ala Gly Ala Tyr Val 
1 5 

aga aga aac gca ggg aga cag ttc agt cac tgc aat ctt cat gcc cat 220 
Arg Arg Asn Ala Gly Arg Gin Phe Ser' His Cys Asn Leu His Ala His 
10 15 20 25 



cag ttt ctt gtg aga aga aaa caa gtg gat ata cac tgt tec aag cag 
Gin Phe Leu Val Arg Arg Lys Gin Val Asp He His Cys Ser Lys Gin 

30 35 40 



268 



cat gtg ttg aaa aga ttt gtc ttt tec cca ttt aat ggt ctt ggt acc 316 
His Val Leu Lys Arg Phe Val Phe Ser Pro Phe Asn Gly Leu Gly Thr 

45 50 55 

ttt etc aaa aat tgaccatata tga 34;^ 
Phe Leu Lys Asn 
60 



<210> 21 
<211> 61 
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<400> 21 

Met Qln Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin 
^ . 5 10 15 

Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys 

20 25 30 

Gin Val Asp He His Cys Ser Lys Gin His Val Leu Lys Arg Phe Val 
35 40 45 

Phe Ser Pro Phe Asn Gly Leu Gly Thr Phe Leu Lys Asn 

55 60 



<210> 22 

<211> 710 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (175) . . (291) 



<400> 22 

tctaggactc agaaatatag atgttagtaa gagcaaacag acataacaga taacacatac 60 
aaagtgccta ccacatgcta accactgctg caggcacttt ctatagaaga actaatttaa 120 

tcatcaccat aaccctatgg .ggtagatgat atttttacaa cctccatttt acag atg 177 

Met 
1 

aag aaa ctg aag cat aga cct get tat gtg aga aga aac gca ggg aga 225 
Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly Arg 

5 10 15 

Qln Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg A?q 
20 25 30 

aaa caa gaa aac aag gac tgaaatccac acaggaaggt ggcagtgaac 321 
Lys Gin Glu Asn Lys Asp 
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35 

tccacagacg gacctggacg cctcaacact cctggcctta cctcccttgc tgaacgtctc 381 

aagtttctct gcgttcagga ctggcaacgc ctgcttcctc ctctgagctg tcaagtagga 441 

agtccgggct gctctgctag aaagagaagt catgtgcagg agcactgagg catcccaggt 501 

gtgacactct tccacctaga gcattccgtc tctcatcctc tgccatgtga cgctgggctt 561 

ctttaacaaa ttaatcccaa gtgcaagaca tttatttctt ctgtacctaa tgacctgagc 621 

aatccttctc tgctgaacct ggtagtgtca tctttagaag tgaagacaca attaacacat 681 

ggtcatttct tcattatatc gttgttact ^^.o 

<210> 23 

<211> 39 

<212> PRT 

<213> Homo sapiens 



<400> 23 



Met Lys Lye Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly 

Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg 

20 25 30 



Arg Lys Gin Glu Aen Lys Asp 
35 



<210> 24 

<211> 949 

<212> DNA 

<213> Homo sapiens 

o 

<220> 

<221> CDS 

<222> (175) . . (366) 



<400> 24 

tctaggactc agaaatatag atgttagtaa gagcaaacag acataacaga taacacatac 
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aaagtgccta ccacatgcta accactgctg caggcacttt ctatagaaga actaatttaa 120 

tcatcaccat aaccctatgg ggtagatgat atttttacaa cctccatttt acag atg 177 

Met 
1 

aag aaa ctg aag cat aga cct get tat gtg aga aga aac gca ggg aga 225 
Lys Lys Leu Lye His Arg Pro Ala Tyr Val Arg Arg Aan Ala Gly Arg 

S 10 15 

f^^ ""f" ""^^ ^^'^ ^^"^ ttt ctt gtg aga aga 273 

Gin Phe Ser His Cys Asn Leu Hie Ala His Gin Phe Leu Val Arg Aro 
20 25 30 

aaa caa gac tgg caa cgc ctg ctt cct cct ctg age tgt caa gta gga 321 
Lys Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val Gly 
35 40 45 

agt ccg ggc tgc tct get aga aag aga agt cat gtg cag gag cac 366 
Ser Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 60 

tgaggcatcc caggtgtgac actcttccao ctagagcatt ccgtctctca tcctctgcca 426 

tgtagcaaac tgctatgcat ccttcagctg caagggattg aatgctatca aoaaccatac 486 

aagtggagaa gcagatgctt ccctagctga gcotcaggct ttttgatgga attgctacaa 546 

cttggtgcat gcctgotcct aaaagaaata ctcaggaatt gtctcataaa gtcctcacct 606 

actggcaaaa acaagatgtt ctactcccag gttgactttt tcaagcccca agatgttgag 666 

tcagccattc tccaaggatc tcgatttcct tttaatggaa aataacatta aacaccaaat 726 

ataagcctcg ctgtcccaoa tgcgtattgg ggacaagatg aaacctgctt ccaggctact 786 

ttggcagcag aactgaaaaa ggcttttttt ccagatatat gatttctcat cgacagggtt 846 

goacagccct ctttattgtt cgtgtaaatg acacccttgg atctgaacaa tacacacoag 906 
gacaattgtg tgoaacagtt ctacaaactg atatttctaa tta 



<210> 25 

<211> 64 

<212> PRT 

<213> Homo sapiens 



<400> 25 



Met Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly 
^5 10 15 

Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg 



949 



wo 2004/056866 PCT/FI2003/000973 
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20 25 30 



Arg Lys Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val 
35 40 45 

Gly Ser Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 60 

<210> 26 

<211> 1068 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (176) . . (367) 



<400> 26 

gtctaggact cagaaatata gatgttagta agagcaaaca gacataacag ataacacata 60 
caaagtgcct accacatgct aaccactgct gcaggcactt tctatagaag aactaattta 120 

atcatcacca taaccctatg gggtagatga tatttttaca acctccattt tacag atg 178 

Met 
1 

aag aaa ctg aag cat aga cct get tat gtg aga aga aac gca ggg aga 226 
Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly Arg 

5 10 15 

cag ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt gtg aga aga 274 
Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg 
20 25 30 

aaa caa gac tgg caa cgc ctg ctt cct cct ctg age tgt caa gta gga 322 
Lys Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val Gly 
35 40 45 

agt ccg ggc tgc tct get aga aag aga agt cat gtg cag gag cac 367 
Ser Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 60 

tgaggcatcc caggtgtgac actcttccac ctagagcatt ccgtctctca tcctctgcca 427 

tgtagcaaac tgctatgcat ccttcagctg caagggattg aatgctatca acaaccatac 487 

aagtggagaa gcagatgctt ccctagctga gcctcagctc tgttttcctc agtcctaggg 547 

gcagattcta atttggtaca gaaaaaaaaa gfcactcctgc caattcccat cctggaatga 607 



wo 2004/056866 



PCT/FI2003/000973 
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ctgaagaaaa gaaaatgcat gtgattattt taaaggcttt ttgatggaat tgctacaact 667 

tggtgcatgc ctgctcctaa aagaaatact caggaattgt ctcataaagt cctcacctac 727 

tggcaaaaac aagatgttct actcccaggt tgactttttc aagccccaag atgttgagtc 787 

agccattctc caaggatctc gatttccttt taatggaaaa taacattaaa caccaaatat 847 

aagcctcgct gtcccacatg cgtattgggg acaagatgaa acctgcttcc aggctacttt 907 

ggcagcagaa ctgaaaaagg ctttttttcc agatatatga tttctcatcg acagggttgc 967 

acagccctct ttattgttcg tgtaaatgac acccttggat ctgaacaata cacaccagga 1027 

caattgtgtg caacagttct acaaactgat atttctaatt a 1068 



<210> 27 

<211> 64 

<212> PRT 

<213> Homo sapiens 



<400> 27 

Met Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly 
1 5 10 15 



Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg 

20 25 30 



Arg Lys Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val 
35 40 45 



Gly Ser Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 60 



<210> 28 



<211> 799 



<212> DNA 



<213> Homo sapiens 



<220> 

<221> CDS 



wo 2004/056866 
<222> (175) . . (363) 
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<400> 28 

tctaggactc agaaatatag atgttagtaa gagcaaacag acataacaga taacacatac 60 

aaagtgccta ccacatgcta accactgctg caggcacttt ctatagaaga actaatttaa 120 

* 

tcatcaccat aaccctatgg ggtagatgat atttttacaa cctccatttt acag atg 177 

Met 
1 

aag aaa ctg aag cat aga cct get tat gtg aga aga aac gca ggg aga 225 
Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly Arg 

5 10 15 

cag ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt gtg aga aga 273 
Gin Phe Ser Hxs Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg 
20 25 30 

aaa caa age aaa ctg eta tgc ate ctt cag ctg caa ggg att gaa tgc 321 
Lys Gin Ser Lys Leu Leu Cys He Leu Gin Leu Gin Gly He Glu Cys 
35 40 45 

tat caa caa cca tac aag tgg aga age aga tgc ttc cct age 363 
Tyr Gin Gin Pro Tyr Lys Trp Arg Ser Arg Cys Phe Pro Ser 
50 55 60 

tgagcctcag getttttgat ggaattgeta caacttggtg catgcctgct ectaaaagaa 423 

atactcagga attgtctcat aaagtcctea cctactggca aaaacaagat gttctactcc 483 

caggttgact ttttcaagee ccaagatgtt gagtcagcea ttctccaagg atctcgattt 543 

ccttttaatg gaaaataaca ttaaacacca aatataagcc tegctgtcce acatgcgtat 603 

tggggacaag atgaaacetg ettccaggct actttggcag cagaactgaa aaaggctttt 663 

tttccagata tatgatttet catcgacagg gttgcacagc cctctttatt gttcgtgtaa 723 

atgacaccct tggatctgaa caatacacac caggacaatt gtgtgcaaca gttctaeaaa 783 
ctgatatttc taatta 



<210> 29 

<211> 63 

<212> PRT 

<213> Homo sapiens 



799 



<400> 29 




wo 2004/056866 
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Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg 

20 25 30 

Arg Lys Gin Ser Lys Leu Leu Cys lie Leu Gin Leu Gin Gly lie Glu 
35 40 45 

Cys Tyr Gin Gin Pro Tyr Lys Trp Arg Ser Arg Cys Phe Pro Ser 
50 55 60 



<210> 30 

<211> 834 

<212> ONA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (30).. (251) 



149 



<400> 30 

gctgatggtg gaaggagaat gagtctctg atg cct ttg gac ttg atg ctg gaa 53 

Met Pro Leu Asp Leu Met Leu Glu 
1 5 

aga ctt aag act ttg ggg gac tac tgg aaa get tat gtg aga aaa aae loi 
Arg Leu Lys Thr Leu Gly Asp Tyr Tr| Lys Ila Tyr ?2 S ^ tin 
10 15 20 

gca ggg aga cag ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt 
Ala Gly Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu 
" 30 35 40 

v!? t^^ t^^ f^^ '^f^ caa cgc ctg ctt cct cct ctg age tgt 197 

Val Arg Arg Lys Gin Asp Trp Gin Arg Leu Leu Pro Pro LeS Ser Cys 

45 50 55 

fJ^. H?^ ^9t cat gtg cag 245 

Gin val Gly Ser Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin 

60 65 70 

Qll His caggtgtgac actcttccac ctagagcatt ccgtetctoa 301 

tcctctgcca tgtagoaaac tgctatgcat ccttcagctg caagggattg aatgctatca . 361 
acaaccatac aagtggagaa gcagatgctt ccctagctga gcctcaggct ttttgatgga 421 
attgctacaa cttggtgcat gcctgctcct aaaagaaata ctoaggaatt gtctcataaa 481 
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gtcctcacct actggcaaaa acaagatgtt ctactcccag gttgactttt tcaagcccca 541 

agatgttgag tcagccattc tccaaggatc tcgatttcct tttaatggaa aataacatta 601 

aacaccaaat ataagcctcg ctgtcccaca tgcgtattgg ggacaagatg aaacctgctt 661 

ccaggctact ttggcagcag aactgaaaaa ggcttttttt ccagatatat gatttctcat 721 

cgacagggtt gcacagccct ctttattgtt cgtgtaaatg acacccttgg atctgaacaa 781 

tacacaccag gacaattgtg tgcaacagtt ctacaaactg atatttctaa tta 834 

<210> 31 
<211> 74 
<212> PRT 

<213> Homo sapiens 



<400> 31 



Met Pro Leu Asp Leu Met Leu Glu Arg Leu Lys Thr Leu Gly Asp Tyr 
1 5 10 15 



Trp Lys Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin Phe Ser His Cys 

20 25 30 



Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys Gin Asp Trp Gin 
35 40 45 



Arg Leu Leu Pro Pro Leu Ser Cys Gin Val Gly Ser Pro Gly Cys Ser 
50 55 60 



Ala Arg Lys Arg Ser His Val Gin Glu His 
65 70 



<210> 32 

<211> 550 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 



wo 2004/056866 
<222> (175) . . (366) 
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<400> 32 

tctaggactc agaaatatag atgttagtaa gagcaaacag acataacaga taacacatac 60 

aaagtgccta ccacatgcta accactgctg caggcacttt ctatagaaga actaatttaa 120 

tcatcaccat aaccctatgg ggtagatgat atttttacaa cctccatttt acag atg 177 

Met 
1 

aag aaa ctg aag cat aga cct get tat gtg aga aga aac gca ggg aga 225 
Iiys hya Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly Arg 

5 10 15 

cag ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt gtg aga aga 273 
Qln Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg 
20 25 30 

aaa caa gac tgg caa cgc ctg ctt cct cct ctg age tgt caa gta gga 321 
Lys Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val Gly 
35 40 45 

agt ccg ggc tgc tct get aga aag aga agt cat gtg cag gag cac 366 
Ser Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 60 

tgaggcatcc caggtgtgac actcttccac ctagagcatt ccgtctctca tcctctgcca 426 
tgtgccatgt tttgaaccac tagattagag ggtcaagcaa tttcttggaa ttttactctg 486 
aattctacgt agaccatttt catgtgtata cctcctctga gtcaccctca ggtagggaca 546 
tttt 



<210> 33 

<211> 64 

<212> PRT 

<213> Homo sapiens 



<400> 33 



Met Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly 
1 5 10 . 15 



Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg 

20 25 30 



550 



Arg Lys Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val 
35 40 45 
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Gly Ser Pro Qly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 go 



<210> 34 

<211> 376 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (175) . . (363) 



<400> 34 

tctaggactc agaaatatag atgttagtaa gagcaaacag acataacaga taacacatac 60 
aaagtgccta ccacatgcta accactgctg caggcacttt ctatagaaga actaatttaa 120 

tcatcaccat aaccctatgg ggtagatgat atttttacaa cctccatttt acag atg 177 

Met 
1 

aag aaa ctg aag cat aga cct get tat gtg aga aga aac gca ggg aga 225 
Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly Arg 

5 10 15 

cag ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt gtg aga aga 273 
Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arq Arq 
20 25 30 

aaa caa gtg gat at a cac tgt tec aag cag cat gtg ttg aaa aga ttt 321 
Lys Gin Val Asp lie His Cys Ser Lys Gin His Val Leu Lys Arg Phe 
35 40 45 

gtc ttt tec cca ttt aat ggt ctt ggt acc ttt etc aaa aat 363 
Val Phe Ser Pro Phe Asn Gly Leu Gly Thr Phe Leu Lys Asn 
50 55 gQ 

tgaccatata tga 

<210> 35 

<211> 63 

<212> PRT 

<213> Homo sapiens 
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<400> 35 



Met Lys Lys Leu Lys His Arg Pro Ala Tyr Val Arg Arg Asn Ala Gly 
^5 10 15 



Arg Gin Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg 

20 25 30 



Arg Lys Gin Val Asp He His Cys Ser Lys Gin His Val Leu Lys Arg 
35 40 45 



Phe Val Phe Ser Pro Phe Asn Gly Leu Gly Thr Phe Leu Lys Asn 
50 55 60 



<210> 36 

<211> 369 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (146) . . (247) 



<400> 36 

gctgatggtg gaaggagaat gagtctctga tgcctttgga cttgatgctg gaaagactta 60 

agactttggg ggactactgg aaaggagtga cttctcccca gatttttgta tacctgactc 120 

tgtttcagca tccgcttccc aaaga atg cag tgt gaa gca gga get tat gtg 172 

Met Gin Cys Glu Ala Gly Ala Tyr Val 
1 5 

aga aga aac gca ggg aga cag ttc agt cac tgc aat ctt cat gcc cat 220 
Arg Arg Asn Ala Gly Arg Gin Phe Ser His Cys Asn Leu His Ala His 
10 15 20 25 

cag ttt ctt gtg aga aga aaa caa gtt taggaaaact tcctacacct 267 
Gin Phe Leu Val Arg Arg Lys Gin Val 

30 

tctttgttgg gatgttctct ggactaatga ctccaggcga gaccaccgtt gatcatgaac 327 
tcactttgaa acagaagctg ggttggtaag actggagcta ct 369 



<210> 37 
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<211> 34 
<212> PRT 

<213> Homo sapiens 



<400> 37 



Met Gin Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin 
^5 10 15 

Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys 

20 25 30 



Gin Val 



<210> 38 

<211> 512 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> CDS 

<222> (62)., (172) 



<400> 38 

gagtgacttc tccccagatt tttgtatacc tgactctgtt tcagcatccg cttcccaaag 60 



a atg cag tgt gaa gca gga get tat gtg aga aga aac gca ggg aga cag 
Met Gin Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin 
^5 10 15 



109 



ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt gtg aga aga aaa 157 
Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys 

20 25 30 

caa gaa aac aag gac tgaaatccac acaggaaggt ggcagtgaac tccacagacg 212 
Gin Glu Asn Lys Asp 
35 

gacctggacg cctcaacact cctggcctta cctcccttgc tgaacgtctc aagtttctct 272 

gcgttcaggt aatgtatagg agggttatga gggcagagaa ttcctaagct cattagtaaa 332 

ttgctcttca gaaaagtgct ttgaagcaaa gctaatttcc tttcccaata tgagaagatt 392 
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tggccctacc agaaaaagga aatgatttga atgtgcgcca aaaaatatgt tttctttctt 452 
ttctttgttg aacactcatc gggagttact cttattagtt ccgcattttt attgccattt 512 

<2lo> 39 
<211> 37 

< 

<212> PRT 
<213> Homo sapiens 

<400> 39 

Met Gin Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin 
15 10 15 

Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys 

20 25 30 

Gin Glu Asn Lys Asp 
35 

<210> 40 
<211> 830 
<212> ONA 
<213> Homo sapiens 

<220> 

<221> CDS 

<222> (62) . . (247) 

<400> 40 

gagtgacttc tccccagatt tttgtatacc tgactctgtt tcagcatccg cttcccaaag 

a atg cag tgt gaa gca gga get tat gtg aga aga aac gca ggg aga cag 
Met Gin Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin 
15 10 15 

ttc agt cac tgc aat ctt cat gcc cat cag ttt ctt gtg aga aga aaa 
Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys 

20 25 30 

caa gac tgg caa cgc ctg ctt cct cct ctg age tgt caa gta gga agt 
Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val Gly Ser 
35 40 45 



60 
109 

157 

205 
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ccg ggc tgc tct get aga aag aga agt cat gtg cag gag cac 247 
Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 60 

tgaggcatcc caggtgtgac actcttccac ctagagcatt ccgtctctca tcctctgcca 307 

tgtagcaaac tgctatgcat ccttcagctg caagggattg aatgctatca acaaccatac 367 

aagtggagaa gcagatgctt ccctagctga gcctcaggct ttttgatgga attgctacaa 427 

cttggtgcat gcctgctcct aaaagaaata ctcaggaatt gtctcataaa gtcctcacct 487 

actggcaaaa acaagatgtt ctactcccag gttgactttt tcaagcccca agatgttgag 547 

tcagccattc tccaaggatc tcgatttcct tttaatggaa aataacatta aacaccaaat 607 

ataagcctcg ctgtcccaca tgcgtattgg ggacaagatg aaacctgctt ccaggctact 667 

ttggcagcag aactgaaaaa ggcttttttt ccagatatat gatttctcat cgacagggtt 727 

gcacagccct ctttattgtt cgtgtaaatg acacccttgg atctgaacaa tacacaccag 787 

gacaattgtg tgcaacagtt ctacaaactg atatttctaa tta 830 

<210> 41 

<211> 62 

<212> PRT 

<213> Homo sapiens 



<400> 41 

Met Gin Cys Glu Ala Gly Ala Tyr Val Arg Arg Asn Ala Gly Arg Gin 
^5 10 15 

Phe Ser His Cys Asn Leu His Ala His Gin Phe Leu Val Arg Arg Lys 

20 25 30 

Gin Asp Trp Gin Arg Leu Leu Pro Pro Leu Ser Cys Gin Val Gly Ser 
35 40 45 



Pro Gly Cys Ser Ala Arg Lys Arg Ser His Val Gin Glu His 
50 55 60 
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Box No. I Nucleotide and/or amino acid sequence(s) (Continuation of iteml.b of the first slieet) 



1 . With regard to any nucleotide and/or amino acid sequence disclosed in the international application and necessaiy to the claimed 
invention, the international search was carried out on die basis of: 

a. type of material 

^ a sequence listing 

table(s) related to the sequence listing 



b. format of material 

in written format 
in computer readable form 




c. time of filing/furnishing 

contained in the international application as filed 
filed together with the international application in computer readable form 
I ] furnished subsequently to this Audiority for the purposes of search 



2. 



In addition, in the case that more than one version oi* copy of a sequence listing and/or table relating thereto has been filed 
OP fumished,.the required statements that the information 4n the.subsequent or additional copies-is identical to^at .in the . 
application as filed or does not go b^ond the application as filed, as appropriate, were furnished 



3. Additional comments: 
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Box No. II Observations where certain claims were found unsearchable (Continuation of item 2 of first sheet) 


This international search report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 


1. 


El 

ft 


Claims Nos.: 25-31 and 34-46 partially 

because they relate to subject matter not required to be searched by this Authority, namely: 

see extra sheet 

• 


2. 




Claims Nos.: 5 and 15 

because they relate to parts of the international application that do not comply with llie prescnbed reqmrements to such an 
extent that no meaningful International search can be carried out, specifically: 

see extra sheet 


3. 


□ 


* 

4 

Claims Nos.: 

because diey are dependent claims and aie not drafted in accordance with (tie second and third sentences of lUile 6.4(a). 


Box No. ni Observations where unity of Invention Is lacking (Continuation of item 3 of first sheet) 


This International Searching Authority found multiple inventions in this international application, as follows: 

■ 

• 


1. 




t 

• 

As all required additional search fees were timely paid by the applicant, this international search report covers all searchable 
claims. 


2. 


□ 


As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment of 
any additional fee. 


3. 


□ 


As only some of the required additional search fees were timely paid by the applicant, this international seardi report covers 

• 

only those claims for which fees were paid, specifically claims Nos.: 


4. 


□ 


• 

No required additional search fees were timely paid by the applicant Consequently, this international search repoct is 
restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 


• • 
Remark on Protest | 1 The additional search fees were accompanied by the applicant's protest 






I [ No protest accompanied the payment of additional search fees. 
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<Box II.1> . 

Claims 25-31 relate to methods of treatment of the human or animal 
body by surgery or by therapy or diagriostic methods practiced on 
the human or animal body (PCT Rule 39.1(iv)). Nevertheless, a 
search has been executed for these claims. The search has been 
based on the alleged effects of the compounds or conqpositions and 
on the polypeptides per se. 

Claims 34-46 xaay relate to methods of treatment of the human or 
animal body by surgery or by therapy or diagnostic methods 
practiced on the human or animal body (PCT Rule 39.1(iv)) since it 
is not clear whether the method is performed in vitro or in vivo. 
Nevertheless, a search has been executed for these claims. The 
search has been based on the asstmiption that the method is 
performed entirely In vitro. 



<Box II. 2> 

Present claims 5 and 15 relate to an extremely large number of 
possible polypeptides. Support within the meaning of Article 6 PCT 
and disclosure within the meaning of Article 5 PCT is to be found, 
however, for only a very small proportion of the polypeptides 
claimed. In the present case, the claims so lack support, and the 
application so lacks disclosure, that a meaningful search over the 
whole of the claimed scope is impossible. 

Consequently, the search has been focused on those parts of the 
claims which appear to be supported and disclosed, namely those 
parts related to the sequences S£Q ID NOs: 3, 5, 7, 9, 11, 13 and 
15 and related sequences including the variants shown in Table 7. 
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